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Introduction 


The emphasis in this course is on problems—doing calculations and story problems. 'To 


master problem solving one needs a tremendous amount of practice doing problems. The 


more problems you do the better you will be at doing them, as patterns will start to emerge 


in both the problems and in successful approaches to them. You will learn fastest and best 


if you devote some time to doing problems every day. 


Typically the most difficult problems are story problems, since they require some effort 


before you can begin calculating. Here are some pointers for doing story problems: 


1. 
2. 


Carefully read each problem twice before writing anything. 


Assign letters to quantities that are described only in words; draw a diagram if 


appropriate. 


. Decide which letters are constants and which are variables. A letter stands for a 


constant if its value remains the same throughout the problem. 


. Using mathematical notation, write down what you know and then write down 


what you want to find. 


. Decide what category of problem it is (this might be obvious if the problem comes 


at the end of a particular chapter, but will not necessarily be so obvious if it comes 


on an exam covering several chapters). 


. Double check each step as you go along; don’t wait until the end to check your 


work. 


. Use common sense; if an answer is out of the range of practical possibilities, then 


check your work to see where you went wrong. 
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Introduction 


Suggestions for Using This Text 


1. Read the example problems carefully, filling in any steps that are left out (ask 


someone for help if you can’t follow the solution to a worked example). 


2. Later use the worked examples to study by covering the solutions, and seeing if 


you can solve the problems on your own. 


. Most exercises have answers in Appendix A; the availability of an answer is marked 


by “=” at the end of the exercise. In the pdf version of the full text, clicking 
on the arrow will take you to the answer. The answers should be used only as 
a final check on your work, not as a crutch. Keep in mind that sometimes an 
answer could be expressed in various ways that are algebraically equivalent, so 
don’t assume that your answer is wrong just because it doesn’t have exactly the 
same form as the answer in the back. 


. A few figures in the pdf and print versions of the book are marked with “(AP)” at 


the end of the caption. Clicking on this should open a related interactive applet 
or Sage worksheet in your web browser. Occasionally another link will do the 
same thing, like this example. (Note to users of a printed text: the words “this 
example” in the pdf file are blue, and are a link to a Sage worksheet.) 


1 


Analytic Geometry 


Much of the mathematics in this chapter will be review for you. However, the examples 
will be oriented toward applications and so will take some thought. 

In the (x, y) coordinate system we normally write the x-axis horizontally, with positive 
numbers to the right of the origin, and the y-axis vertically, with positive numbers above 
the origin. That is, unless stated otherwise, we take “rightward” to be the positive x- 
direction and “upward” to be the positive y-direction. In a purely mathematical situation, 
we normally choose the same scale for the x- and y-axes. For example, the line joining the 
origin to the point (a,a) makes an angle of 45° with the z-axis (and also with the y-axis). 

In applications, often letters other than x and y are used, and often different scales are 
chosen in the horizontal and vertical directions. For example, suppose you drop something 
from a window, and you want to study how its height above the ground changes from 
second to second. It is natural to let the letter t denote the time (the number of seconds 
since the object was released) and to let the letter h denote the height. For each t (say, 
at one-second intervals) you have a corresponding height h. This information can be 
tabulated, and then plotted on the (t,h) coordinate plane, as shown in figure 1.0.1. 

We use the word “quadrant” for each of the four regions into which the plane is 
divided by the axes: the first quadrant is where points have both coordinates positive, 
or the “northeast” portion of the plot, and the second, third, and fourth quadrants are 
counted off counterclockwise, so the second quadrant is the northwest, the third is the 
southwest, and the fourth is the southeast. 

Suppose we have two points A and B in the (z, y)-plane. We often want to know the 
change in x-coordinate (also called the “horizontal distance”) in going from A to B. This 
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seconds 0 1 2 3 4 
meters 80 75.1 60.4 35.9 1.6 


80 


Figure 1.0.1 A data plot, height versus time. 


is often written Ax, where the meaning of A (a capital delta in the Greek alphabet) is 
“change in”. (Thus, Az can be read as “change in x” although it usually is read as “delta 
x”. The point is that Ax denotes a single number, and should not be interpreted as “delta 
times x”.) For example, if A = (2,1) and B = (3,3), Ag = 3-—2 = 1. Similarly, the 
“change in y” is written Ay. In our example, Ay = 3 — 1 = 2, the difference between the 
y-coordinates of the two points. It is the vertical distance you have to move in going from 
A to B. The general formulas for the change in x and the change in y between a point 
(v1, y1) and a point (2, y2) are: 


Ag =i9 = 215 Ay = y2—- 1. 


Note that either or both of these might be negative. 


1.1 LINES 


If we have two points A(x1,y,) and B(x, y2), then we can draw one and only one line 
through both points. By the slope of this line we mean the ratio of Ay to Az. The slope 
is often denoted m: m = Ay/Ax = (y2 — y1)/(a2 — 41). For example, the line joining the 
points (1, —2) and (3,5) has slope (5 + 2)/(3 — 1) = 7/2. 


EXAMPLE 1.1.1 According to the 1990 U.S. federal income tax schedules, a head 
of household paid 15% on taxable income up to $26050. If taxable income was between 
$26050 and $134930, then, in addition, 28% was to be paid on the amount between $26050 
and $67200, and 33% paid on the amount over $67200 (if any). Interpret the tax bracket 
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information (15%, 28%, or 33%) using mathematical terminology, and graph the tax on 
the y-axis against the taxable income on the x-axis. 

The percentages, when converted to decimal values 0.15, 0.28, and 0.33, are the slopes 
of the straight lines which form the graph of the tax for the corresponding tax brackets. 
The tax graph is what’s called a polygonal line, i.e., it’s made up of several straight line 
segments of different slopes. The first line starts at the point (0,0) and heads upward 
with slope 0.15 (i-e., it goes upward 15 for every increase of 100 in the x-direction), until 
it reaches the point above x = 26050. Then the graph “bends upward,” i.e., the slope 
changes to 0.28. As the horizontal coordinate goes from x = 26050 to x = 67200, the line 
goes upward 28 for each 100 in the x-direction. At x = 67200 the line turns upward again 


and continues with slope 0.33. See figure 1.1.1. 


30000 4 


20000 4 


10000 +; 


50000 100000 134930 


Figure 1.1.1 Tax vs. income. 


The most familiar form of the equation of a straight line is: y = mx+0. Here m is the 
slope of the line: if you increase x by 1, the equation tells you that you have to increase y 
by m. If you increase x by Az, then y increases by Ay = mAz. The number 0 is called 
the y-intercept, because it is where the line crosses the y-axis. If you know two points 
on a line, the formula m = (y2 — y1)/(%2 — 21) gives you the slope. Once you know a point 
and the slope, then the y-intercept can be found by substituting the coordinates of either 
point in the equation: y, = mz, +b, ie., b= y; — mz ,. Alternatively, one can use the 
“point-slope” form of the equation of a straight line: start with (y— y,)/(a—2x1) =m and 
then multiply to get (y — y1) = m(az — 21), the point-slope form. Of course, this may be 
further manipulated to get y = mx — mx, + y1, which is essentially the “ma + b” form. 

It is possible to find the equation of a line between two points directly from the relation 
(y—y1)/(@ — 21) = (y2 — y1)/(*2 — £1), which says “the slope measured between the point 
(v1, y1) and the point (x2, y2) is the same as the slope measured between the point (71, yi) 
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and any other point (x,y) on the line.” For example, if we want to find the equation of 
the line joining our earlier points A(2,1) and B(3,3), we can use this formula: 


y-1_ 3-1 


5 a so that y—1=2(x-2), 15 y= 22 = 3. 


Of course, this is really just the point-slope formula, except that we are not computing m 
in a separate step. 

The slope m of a line in the form y = mz + b tells us the direction in which the line is 
pointing. If m is positive, the line goes into the 1st quadrant as you go from left to right. 
If m is large and positive, it has a steep incline, while if m is small and positive, then the 
line has a small angle of inclination. If m is negative, the line goes into the 4th quadrant 
as you go from left to right. If m is a large negative number (large in absolute value), then 
the line points steeply downward; while if m is negative but near zero, then it points only 
a little downward. These four possibilities are illustrated in figure 1.1.2. 


4 4 4 4 
2 2 2 2 
0 0 0) 0 
2 2 2 2 
4 4 4 4 
-4-2 0 2 4 —-4-2 0 2 4 —-4-2 0 2 4 —-4-2 0 2 4 


Figure 1.1.2 Lines with slopes 3, 0.1, —4, and —0.1. 


If m = 0, then the line is horizontal: its equation is simply y = b. 

There is one type of line that cannot be written in the form y = mz + b, namely, 
vertical lines. A vertical line has an equation of the form x = a. Sometimes one says that 
a vertical line has an “infinite” slope. 

Sometimes it is useful to find the x-intercept of a line y = mx +b. This is the x-value 
when y = 0. Setting mx + b equal to 0 and solving for x gives: x = —b/m. For example, 
the line y = 2x — 3 through the points A(2,1) and B(3,3) has x-intercept 3/2. 


EXAMPLE 1.1.2 Suppose that you are driving to Seattle at constant speed, and notice 
that after you have been traveling for 1 hour (i.e., £ = 1), you pass a sign saying it is 110 
miles to Seattle, and after driving another half-hour you pass a sign saying it is 85 miles 
to Seattle. Using the horizontal axis for the time t and the vertical axis for the distance y 
from Seattle, graph and find the equation y = mt + b for your distance from Seattle. Find 
the slope, y-intercept, and t-intercept, and describe the practical meaning of each. 

The graph of y versus t¢ is a straight line because you are traveling at constant speed. 
The line passes through the two points (1,110) and (1.5, 85), so its slope is m = (85 — 
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110)/(1.5 — 1) = —50. The meaning of the slope is that you are traveling at 50 mph; m is 
negative because you are traveling toward Seattle, i.e., your distance y is decreasing. The 
word “velocity” is often used for m= —50, when we want to indicate direction, while the 
word “speed” refers to the magnitude (absolute value) of velocity, which is 50 mph. To 
find the equation of the line, we use the point-slope formula: 


— 110 
-— =-50,  sothat y= —50(t—1) +110 = —50t +160. 


The meaning of the y-intercept 160 is that when t = 0 (when you started the trip) you were 
160 miles from Seattle. To find the t-intercept, set 0 = —50¢+160, so that t = 160/50 = 3.2. 


The meaning of the t-intercept is the duration of your trip, from the start until you arrive 
in Seattle. After traveling 3 hours and 12 minutes, your distance y from Seattle will be 0. 


Exercises 1.1. 


Find the equation of the line through (1,1) and (—5, —3) in the form y= ma+b. > 
Find the equation of the line through (—1,2) with slope —2 in the form y = maz +b. > 


Find the equation of the line through (—1, 1) and (5, —3) in the form y= maz+b. > 


So 


Change the equation y — 2x = 2 to the form y = ma + b, graph the line, and find the 
y-intercept and x-intercept. > 


5. Change the equation x+y = 6 to the form y = ma-+b, graph the line, and find the y-intercept 
and x-intercept. > 


6. Change the equation x = 2y — 1 to the form y = ma -+ }, graph the line, and find the 
y-intercept and x-intercept. => 


7. Change the equation 3 = 2y to the form y = mx +), graph the line, and find the y-intercept 
and x-intercept. > 


8. Change the equation 27 + 3y + 6 = 0 to the form y = mz + b, graph the line, and find the 
y-intercept and x-intercept. > 


9. Determine whether the lines 32 + 6y = 7 and 2a + 4y = 5 are parallel. > 


10. Suppose a triangle in the x, y—plane has vertices (—1,0), (1,0) and (0,2). Find the equations 
of the three lines that lie along the sides of the triangle in y = max + b form. > 


11. Suppose that you are driving to Seattle at constant speed. After you have been traveling 
for an hour you pass a sign saying it is 130 miles to Seattle, and after driving another 20 
minutes you pass a sign saying it is 105 miles to Seattle. Using the horizontal axis for the 
time ¢ and the vertical axis for the distance y from your starting point, graph and find the 
equation y = mt + b for your distance from your starting point. How long does the trip to 
Seattle take? => 


12. Let x stand for temperature in degrees Celsius (centigrade), and let y stand for temperature in 
degrees Fahrenheit. A temperature of 0°C corresponds to 32°F, and a temperature of 100°C 
corresponds to 212°F. Find the equation of the line that relates temperature Fahrenheit y to 
temperature Celsius x in the form y = ma + b. Graph the line, and find the point at which 
this line intersects y = x. What is the practical meaning of this point? => 
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A car rental firm has the following charges for a certain type of car: $25 per day with 100 
free miles included, $0.15 per mile for more than 100 miles. Suppose you want to rent a 
car for one day, and you know you’ll use it for more than 100 miles. What is the equation 
relating the cost y to the number of miles x that you drive the car? > 


A photocopy store advertises the following prices: 5¢ per copy for the first 20 copies, 4¢ per 
copy for the 21st through 100th copy, and 3¢ per copy after the 100th copy. Let x be the 
number of copies, and let y be the total cost of photocopying. (a) Graph the cost as x goes 
from 0 to 200 copies. (b) Find the equation in the form y = ma + 6 that tells you the cost 
of making x copies when zx is more than 100. > 


In the Kingdom of Xyg the tax system works as follows. Someone who earns less than 100 
gold coins per month pays no tax. Someone who earns between 100 and 1000 gold coins 
pays tax equal to 10% of the amount over 100 gold coins that he or she earns. Someone 
who earns over 1000 gold coins must hand over to the King all of the money earned over 
1000 in addition to the tax on the first 1000. (a) Draw a graph of the tax paid y versus the 
money earned x, and give formulas for y in terms of x in each of the regions 0 < x < 100, 
100 < a < 1000, and x > 1000. (b) Suppose that the King of Xyg decides to use the second 
of these line segments (for 100 < x < 1000) for x < 100 as well. Explain in practical terms 
what the King is doing, and what the meaning is of the y-intercept. > 


The tax for a single taxpayer is described in the figure 1.1.3. Use this information to graph 
tax versus taxable income (i.e., x is the amount on Form 1040, line 37, and y is the amount on 
Form 1040, line 38). Find the slope and y-intercept of each line that makes up the polygonal 
graph, up to x = 97620. > 


1990 Tax Rate Schedules 


Schedule X—Use if your filing status is Schedule Z—vse if your filing status is 
Single Head of household 
If the amount Enter on of the If the amount Enter on of the 
on Form 1040 But not Form 1040 amount | on Form 1040 But not Form 1040 amount 
line 37 is over: over: line 38 over: line 37 is over: over: line 38 over: 
$0 $19,450 15% $0 $0 $26,050 15% $0 
19,450 47,050 $2,917.50+28% 19,450 26,050 67,200 $3,907.50+28% 26,050 
47,050 97,620 $10,645.50+33% 47,050 67,200 134,930 $15,429.50+33% 67,200 
Use Worksheet Use Worksheet 
975620. wsseesareses below to figure 134,930!) asscestasass below to figure 
your tax your tax 


17. 


18. 


Figure 1.1.3. Tax Schedule. 


Market research tells you that if you set the price of an item at $1.50, you will be able to sell 
5000 items; and for every 10 cents you lower the price below $1.50 you will be able to sell 
another 1000 items. Let x be the number of items you can sell, and let P be the price of an 
item. (a) Express P linearly in terms of x, in other words, express P in the form P = ma+b. 
(b) Express x linearly in terms of P. > 


An instructor gives a 100-point final exam, and decides that a score 90 or above will be a 
grade of 4.0, a score of 40 or below will be a grade of 0.0, and between 40 and 90 the grading 
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will be linear. Let x be the exam score, and let y be the corresponding grade. Find a formula 
of the form y = maz + b which applies to scores « between 40 and 90. > 


1.2 DISTANCE BETWEEN Two POINTS; CIRCLES 


Given two points (21, y) and (£9, y2), recall that their horizontal distance from one another 
is Ax = x2—2, and their vertical distance from one another is Ay = y2—y1. (Actually, the 
word “distance” normally denotes “positive distance”. Az and Ay are signed distances, 
but this is clear from context.) The actual (positive) distance from one point to the other 
is the length of the hypotenuse of a right triangle with legs |Az| and |Ay|, as shown in 
figure 1.2.1. The Pythagorean theorem then says that the distance between the two points 
is the square root of the sum of the squares of the horizontal and vertical sides: 


distance = \/(Azx)? + (Ay)? = \/(x2 — 21)? + (yo — y1)?. 


For example, the distance between points A(2, 1) and B(3, 3) is \/(3 — 2)? + (3 — 1)? = V5. 


(x2, ye) 


Ay 


(24; y1) 
Ax 


Figure 1.2.1 Distance between two points, Az and Ay positive. 


As a special case of the distance formula, suppose we want to know the distance of a 


point (x,y) to the origin. According to the distance formula, this is \/(2 — 0)? + (y — 0)? = 
A point (a,y) is at a distance r from the origin if and only if \/x2 + y? =r, or, if we 
square both sides: x? + y? = r?. This is the equation of the circle of radius r centered at 
the origin. The special case r = 1 is called the unit circle; its equation is x? + y? = 1. 
Similarly, if C(h,k) is any fixed point, then a point (x,y) is at a distance r from the 
point C if and only if \/(@ —h)? + (y—k)? =7, ie, if and only if 


(x —h)? + (y—k)? =r. 


This is the equation of the circle of radius r centered at the point (h,k). For example, the 
circle of radius 5 centered at the point (0,—6) has equation (a — 0)? + (y — —6)? = 25, or 
x? +(y+6)? = 25. If we expand this we get 27+ y?+12y+36 = 25 or x7+y7+12y+11 =0, 
but the original form is usually more useful. 
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EXAMPLE 1.2.1 Graph the circle x? — 22 + y? + 4y — 11 = 0. With a little thought 
we convert this to (x — 1)? + (y + 2)? — 16 = 0 or (2 — 1)? + (y+ 2)? = 16. Now we see 
that this is the circle with radius 4 and center (1, —2), which is easy to graph. 


Exercises 1.2. 


1. Find the equation of the circle of radius 3 centered at: 


a) (0,0) d) (0,3) 

b) (5,6) e) (0,—3) 

c) (—5, —6) f) (3,0) 
=> 


2. For each pair of points A(a1,y1) and B(x2, y2) find (i) Av and Ay in going from A to B, 
(ii) the slope of the line joining A and B, (iii) the equation of the line joining A and B in 
the form y = mz + b, (iv) the distance from A to B, and (v) an equation of the circle with 
center at A that goes through B. 


a) A(2,0), B(4,3) d) A(—2,3), B(4,3) 

b) A(1,—1), B(0, 2) e) A(—3,—2), B(0,0) 

c) A(0,0), B(—2,—2) f) A(0.01, 0.01), B(—0.01, 0.05) 
=> 


. Graph the circle x? + y? + 10y = 0. 
. Graph the circle x? — 10x + y? = 24. 
. Graph the circle x? — 6x + y? — 8y = 0. 


aoa . Ww 


. Find the standard equation of the circle passing through (—2,1) and tangent to the line 
3x — 2y = 6 at the point (4,3). Sketch. (Hint: The line through the center of the circle and 
the point of tangency is perpendicular to the tangent line.) > 


1.3. FUNCTIONS 


A function y = f(x) is a rule for determining y when we’re given a value of x. For 
example, the rule y = f(x) = 2x +1 is a function. Any line y = maz -+ b is called a 
linearfunction. The graph of a function looks like a curve above (or below) the z-axis, 
where for any value of x the rule y = f(z) tells us how far to go above (or below) the 
x-axis to reach the curve. 

Functions can be defined in various ways: by an algebraic formula or several algebraic 
formulas, by a graph, or by an experimentally determined table of values. (In the latter 
case, the table gives a bunch of points in the plane, which we might then interpolate with 
a smooth curve, if that makes sense.) 

Given a value of x, a function must give at most one value of y. Thus, vertical lines 
are not functions. For example, the line x = 1 has infinitely many values of y if x = 1. It 


1.3. Functions 21 


is also true that if x is any number not 1 there is no y which corresponds to x, but that is 
not a problem—only multiple y values is a problem. 

In addition to lines, another familiar example of a function is the parabola y = f(x) = 
x”. We can draw the graph of this function by taking various values of x (say, at regular 
intervals) and plotting the points (x, f(x)) = (z,x7). Then connect the points with a 
smooth curve. (See figure 1.3.1.) 


2 are both functions which 


The two examples y = f(r) = 2x + land y= f(r) = 2 
can be evaluated at any value of x from negative infinity to positive infinity. For many 
functions, however, it only makes sense to take x in some interval or outside of some 


“forbidden” region. The interval of x-values at which we’re allowed to evaluate the function 


aa 


is called the domain of the function. 


~= fo =z y=f(e)=Vz y= f(x) =1/2 
Figure 1.3.1 Some graphs. 
For example, the square-root function y = f(x) = \/z is the rule which says, given an 


x-value, take the nonnegative number whose square is x. This rule only makes sense if x 
is positive or zero. We say that the domain of this function is x > 0, or more formally 


{x €R|a > 0}. Alternately, we can use interval notation, and write that the domain is 
(0,00). (In interval notation, square brackets mean that the endpoint is included, and a 
parenthesis means that the endpoint is not included.) The fact that the domain of y = /x 
is [0,oo) means that in the graph of this function ((see figure 1.3.1) we have points (z, y) 
only above x-values on the right side of the x-axis. 

Another example of a function whose domain is not the entire z-axis is: y = f(x) = 
1/ax, the reciprocal function. We cannot substitute « = 0 in this formula. The function 
makes sense, however, for any nonzero x, so we take the domain to be: {7 € R | x F O}. 
The graph of this function does not have any point (x,y) with x = 0. As x gets close to 
O from either side, the graph goes off toward infinity. We call the vertical line x = 0 an 
asymptote. 

To summarize, two reasons why certain x-values are excluded from the domain of a 


function are that (i) we cannot divide by zero, and (ii) we cannot take the square root 
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of a negative number. We will encounter some other ways in which functions might be 
undefined later. 

Another reason why the domain of a function might be restricted is that in a given 
situation the z-values outside of some range might have no practical meaning. For example, 
if y is the area of a square of side x, then we can write y = f(x) = x7. In a purely 
mathematical context the domain of the function y = 2? is all of R. But in the story- 
problem context of finding areas of squares, we restrict the domain to positive values of x, 
because a square with negative or zero side makes no sense. 

In a problem in pure mathematics, we usually take the domain to be all values of x 
at which the formulas can be evaluated. But in a story problem there might be further 
restrictions on the domain because only certain values of x are of interest or make practical 
sense. 

In a story problem, often letters different from x and y are used. For example, the 
volume V of a sphere is a function of the radius r, given by the formula V = f(r) = 4/37r°. 
Also, letters different from f may be used. For example, if y is the velocity of something 
at time t, we may write y = u(t) with the letter v (instead of f) standing for the velocity 
function (and t playing the role of x). 

The letter playing the role of x is called the independent variable, and the letter 
playing the role of y is called the dependent variable (because its value “depends on” 
the value of the independent variable). In story problems, when one has to translate from 
English into mathematics, a crucial step is to determine what letters stand for variables. 
If only words and no letters are given, then we have to decide which letters to use. Some 
letters are traditional. For example, almost always, t stands for time. 


EXAMPLE 1.3.1 An open-top box is made from an a xb rectangular piece of cardboard 
by cutting out a square of side x from each of the four corners, and then folding the sides 
up and sealing them with duct tape. Find a formula for the volume V of the box as a 
function of x, and find the domain of this function. 
The box we get will have height x and rectangular base of dimensions a— 2x by b— 22. 
Thus, 
V = f(x) = x(a — 2x)(b— 22). 


Here a and 0 are constants, and V is the variable that depends on 2, i.e., V is playing the 
role of y. 

This formula makes mathematical sense for any x, but in the story problem the domain 
is much less. In the first place, « must be positive. In the second place, it must be less 
than half the length of either of the sides of the cardboard. Thus, the domain is 


1 
tore Re WO <er 3 (minimum of a and b)}. 
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In interval notation we write: the domain is the interval (0, min(a, b)/2). (You might think 
about whether we could allow 0 or min(a,b)/2 to be in the domain. They make a certain 
physical sense, though we normally would not call the result a box. If we were to allow 


these values, what would the corresponding volumes be? Does that make sense?) 


EXAMPLE 1.3.2 Circle of radius r centered at the origin The equation for 
this circle is usually given in the form x? + y? = r?. To write the equation in the form 
y = f(x) we solve for y, obtaining y = +Vr? — x2. But this is not a function, because 
when we substitute a value in (—r,r) for x there are two corresponding values of y. To get 
a function, we must choose one of the two signs in front of the square root. If we choose 
the positive sign, for example, we get the upper semicircle y = f(x) = Vr? — x? (see 
figure 1.3.2). The domain of this function is the interval |—r,r], i-e., must be between —r 


2 


and r (including the endpoints). If x is outside of that interval, then r? — x? is negative, 


and we cannot take the square root. In terms of the graph, this just means that there are 


no points on the curve whose x-coordinate is greater than r or less than —r. 


—r r 


Figure 1.3.2 Upper semicircle y = Vr? — x? 


EXAMPLE 1.3.3 Find the domain of 


1 


ae! a reer 


To answer this question, we must rule out the x-values that make 42 — x? negative (because 
we cannot take the square root of a negative number) and also the x-values that make 
4x — x? zero (because if 42 — x? = 0, then when we take the square root we get 0, and 
we cannot divide by 0). In other words, the domain consists of all 2 for which 4x — x? is 
strictly positive. We give two different methods to find out when 4a” — x? > 0. 

First method. Factor 42 — x? as x(4 — x). The product of two numbers is positive 


when either both are positive or both are negative, i.e., if either x > 0 and 4— 2 > 0, 
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or else x < 0 and 4—2 < 0. The latter alternative is impossible, since if x is negative, 
then 4 — x is greater than 4, and so cannot be negative. As for the first alternative, the 
condition 4 — x > 0 can be rewritten (adding x to both sides) as 4 > x, so we need: x > 0 
and 4 > x (this is sometimes combined in the form 4 > x > 0, or, equivalently, 0 < x < 4). 
In interval notation, this says that the domain is the interval (0, 4). 

Second method. Write 4x — 2? as —(x? — 4x), and then complete the square, 


obtaining -((e — 2)? — 4) = 4— (x — 2)". For this to be positive we need (x — 2)? < 4, 
which means that « — 2 must be less than 2 and greater than —2: —2 < r—2 < 2. Adding 
2 to everything gives 0 < x < 4. Both of these methods are equally correct; you may use 


either in a problem of this type. 


A function does not always have to be given by a single formula, as we have already 
seen (in the income tax problem, for example). Suppose that y = v(t) is the velocity 
function for a car which starts out from rest (zero velocity) at time t = 0; then increases 
its speed steadily to 20 m/sec, taking 10 seconds to do this; then travels at constant speed 
20 m/sec for 15 seconds; and finally applies the brakes to decrease speed steadily to 0, 
taking 5 seconds to do this. The formula for y = v(t) is different in each of the three time 
intervals: first y = 2x”, then y = 20, then y = —4a + 120. The graph of this function is 
shown in figure 1.3.3. 


U 


20 


10 - 


0 t 
10 25 30 


Figure 1.3.3 A velocity function. 


Not all functions are given by formulas at all. A function can be given by an ex- 
perimentally determined table of values, or by a description other than a formula. For 
example, the population y of the U.S. is a function of the time t: we can write y = f(t). 
This is a perfectly good function—we could graph it (up to the present) if we had data for 
various t—but we can’t find an algebraic formula for it. 
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Exercises 1.3. 


Find the domain of each of the following functions: 


1. 
. y= fle) =1/(2+1) => 
 ¥=f@)=1/e -1l)= 


Ly =f) = Ve 


Se oe 
- © 


= 
bo 


13. 


14. 


15. 


16. 


CaoONanp wD 


y = f(x) =V2x4-3> 


y=f(2)=Vr=> 
y=f(®)=Vz=> 
y = f(a) = Vr? — (a — h)? , where r is a positive constant. > 


-y=f(e)=V1-(1/2) => 
yea flea ly l= Be? = 
-y=f(t)=Ve+l/(x-1)=> 
-y=f(t)=1/(ve-1l => 


. Find the domain of h(a) = { iy —9)/(x—3) 2438 sie 


ifa=3. 

Suppose f(x) = 32 — 9 and g(x) = Vz. What is the domain of the composition (go f)(x)? 
(Recall that composition is defined as (go f)(x) = g(f(x)).) What is the domain of 
(fog)(x)? > 

A farmer wants to build a fence along a river. He has 500 feet of fencing and wants to enclose 
a rectangular pen on three sides (with the river providing the fourth side). If x is the length 
of the side perpendicular to the river, determine the area of the pen as a function of x. What 
is the domain of this function? > 


A can in the shape of a cylinder is to be made with a total of 100 square centimeters of 
material in the side, top, and bottom; the manufacturer wants the can to hold the maximum 
possible volume. Write the volume as a function of the radius r of the can; find the domain 
of the function. > 


A can in the shape of a cylinder is to be made to hold a volume of one liter (1000 cubic 
centimeters). The manufacturer wants to use the least possible material for the can. Write 
the surface area of the can (total of the top, bottom, and side) as a function of the radius r 
of the can; find the domain of the function. = 


1.4 SHIFTS AND DILATIONS 


Many functions in applications are built up from simple functions by inserting constants 


in various places. It is important to understand the effect such constants have on the 


appearance of the graph. 


Horizontal shifts. [f we replace x by x —C everywhere it occurs in the formula for f(x), 


then the graph shifts over C to the right. (If C is negative, then this means that the graph 


shifts over |C| to the left.) For example, the graph of y = (2—2)? is the x?-parabola shifted 


over to have its vertex at the point 2 on the z-axis. The graph of y = (x+ 1)? is the same 
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parabola shifted over to the left so as to have its vertex at —1 on the x-axis. Note well: 
when replacing x by « — C’ we must pay attention to meaning, not merely appearance. 
Starting with y = x? and literally replacing x by x — 2 gives y = x — 2?. This is y = x —4, 
a line with slope 1, not a shifted parabola. 

Vertical shifts. If we replace y by y — D, then the graph moves up D units. (If D is 
negative, then this means that the graph moves down |D| units.) If the formula is written 
in the form y = f(x) and if y is replaced by y— D to get y— D = f(x), we can equivalently 
move D to the other side of the equation and write y = f(z) + D. Thus, this principle 
can be stated: to get the graph of y = f(x) + D, take the graph of y = f(x) and move it 
D units up. For example, the function y = 2? — 4x = (x — 2)? — 4 can be obtained from 
y = (x — 2)? (see the last paragraph) by moving the graph 4 units down. The result is the 
x?-parabola shifted 2 units to the right and 4 units down so as to have its vertex at the 
point (2, —4). 

Warning. Do not confuse f(z)+D and f(x+D). For example, if f(z) is the function 2”, 
then f(x) +2 is the function x? + 2, while f(z +2) is the function (2 + 2)? = 2?7+42+4+4. 


EXAMPLE 1.4.1 Circles An important example of the above two principles starts 
with the circle 2? + y? = r?. This is the circle of radius r centered at the origin. (As we 
saw, this is not a single function y = f(x), but rather two functions y = +Vr? — x? put 
together; in any case, the two shifting principles apply to equations like this one that are 
not in the form y = f(zx).) If we replace x by x — C and replace y by y — D—getting the 
equation (x — C)? + (y — D)? = r?—the effect on the circle is to move it C to the right 
and D up, thereby obtaining the circle of radius r centered at the point (C, D). This tells 


us how to write the equation of any circle, not necessarily centered at the origin. 


We will later want to use two more principles concerning the effects of constants on 
the appearance of the graph of a function. 


Horizontal dilation. Jf x is replaced by x/A in a formula and A > 1, then the effect on 
the graph is to expand it by a factor of A in the x-direction (away from the y-axis). If A 
is between 0 and 1 then the effect on the graph is to contract by a factor of 1/A (towards 
the y-axis). We use the word “dilate” to mean expand or contract. 

For example, replacing x by x/0.5 = x/(1/2) = 2z has the effect of contracting toward 
the y-axis by a factor of 2. If A is negative, we dilate by a factor of |A| and then flip 
about the y-axis. Thus, replacing x by —2 has the effect of taking the mirror image of the 
graph with respect to the y-axis. For example, the function y = /—z, which has domain 
{x € R| a < 0}, is obtained by taking the graph of \/z and flipping it around the y-axis 
into the second quadrant. 
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Vertical dilation. If y is replaced by y/B in a formula and B > 0, then the effect on 
the graph is to dilate it by a factor of B in the vertical direction. As before, this is an 
expansion or contraction depending on whether B is larger or smaller than one. Note that 
if we have a function y = f(x), replacing y by y/B is equivalent to multiplying the function 
on the right by B: y = Bf (zx). The effect on the graph is to expand the picture away from 
the x-axis by a factor of B if B > 1, to contract it toward the z-axis by a factor of 1/B if 
0 < B <1, and to dilate by |B] and then flip about the z-axis if B is negative. 


EXAMPLE 1.4.2. Ellipses A basic example of the two expansion principles is given 
by an ellipse of semimajor axis a and semiminor axis b. We get such an ellipse by 
starting with the unit circle—the circle of radius 1 centered at the origin, the equation 
of which is x? + y? = 1—and dilating by a factor of a horizontally and by a factor of b 
vertically. To get the equation of the resulting ellipse, which crosses the x-axis at +a and 
crosses the y-axis at +b, we replace x by x/a and y by y/b in the equation for the unit 
circle. This gives 


2 2 
(=) +(¢) —]| or Oi oa 


Finally, if we want to analyze a function that involves both shifts and dilations, it 
is usually simplest to work with the dilations first, and then the shifts. For instance, if 
we want to dilate a function by a factor of A in the x-direction and then shift C' to the 
right, we do this by replacing x first by x/A and then by (a — C) in the formula. As an 
example, suppose that, after dilating our unit circle by a in the x-direction and by b in the 
y-direction to get the ellipse in the last paragraph, we then wanted to shift it a distance 
h to the right and a distance k upward, so as to be centered at the point (h,k). The new 


a y—k\? 
2 i, 
Ge aw. 
Note well that this is different than first doing shifts by h and k and then dilations by a 


and b: : : 
(1) (f-)'=1 


ellipse would have equation 


See figure 1.4.1. 
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Figure 1.4.1 __Ellipses: (222 )P + Gay = 1 on the left, (s i (4 Lye = 1 on the 
right. 


Exercises 1.4. 


Starting with the graph of y = Vz, the graph of y = 1/z, and the graph of y = V/1— <x? (the 
upper unit semicircle), sketch the graph of each of the following functions: 


1. f(z) =V2r—-2 2. f(x) =—-1-1/(¢+2) 

3. f(r) =44Ve4+2 A. y= f(z) =2/(1-2) 

5. y= f(z) =—-V—-« 6. f(x) =24+ V1-—(«-1)? 

7. f(x) =—44+ J/—-(#£ — 2) 8. f(x) = 2/1 — (2/3)? 

9. f(x) =1/(@+4+1) 10. f(x) =44+2V/1-—(x£—5)?/9 
11. f(x) =1+4+1/(@—-1) 12. f(x) =./100— 25(@ — 1)? +2 
The graph of f(x) is shown below. Sketch the graphs of the following functions. 
13. y = f(z-1) Fe: 

14. y=14 f(x+2) 

15. y=1+4 2f(c) Is 

16. y = 2f (32) 

17. y = 2f(3(a —2)) +1 2 

18. y = (1/2) f (32 — 3) 1 

19. y=f(1+2/3)+2 
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Instantaneous Rate of Change: 
The Derivative 


2.1 THE SLOPE OF A FUNCTION 


Suppose that y is a function of x, say y = f(a). It is often necessary to know how sensitive 


the value of y is to small changes in 2. 


EXAMPLE 2.1.1 Take, for example, y = f(x) = \/625 — x? (the upper semicircle of 
radius 25 centered at the origin). When x = 7, we find that y = 625 — 49 = 24. Suppose 
we want to know how much y changes when «x increases a little, say to 7.1 or 7.01. 

In the case of a straight line y = ma-+b, the slope m = Ay/Az measures the change in 
y per unit change in x. This can be interpreted as a measure of “sensitivity”; for example, 
if y = 1002 + 5, a small change in x corresponds to a change one hundred times as large 
in y, so y is quite sensitive to changes in x. 

Let us look at the same ratio Ay/Az for our function y = f(x) = 625 — x? when x 
changes from 7 to 7.1. Here Ax = 7.1 — 7 = 0.1 is the change in x, and 


Ay = f(z + Az) — f(x) = f(7.1) — f(7) 
= \/625 — 7.12 — \/625 — 72 = 23.9706 — 24 = —0.0294. 


Thus, Ay/Ax = —0.0294/0.1 = —0.294. This means that y changes by less than one 
third the change in x, so apparently y is not very sensitive to changes in x at x = 7. 
We say “apparently” here because we don’t really know what happens between 7 and 7.1. 
Perhaps y changes dramatically as x runs through the values from 7 to 7.1, but at 7.1 y 
just happens to be close to its value at 7. This is not in fact the case for this particular 


function, but we don’t yet know why. 
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One way to interpret the above calculation is by reference to a line. We have computed 
the slope of the line through (7,24) and (7.1, 23.9706), called a chord of the circle. In 
general, if we draw the chord from the point (7,24) to a nearby point on the semicircle 
(7+ Ax, f(7 + Azx)), the slope of this chord is the so-called difference quotient 


f(7+Az) — f(7) _ /625 — (7+ Az)? — 24 
Ax a Ax ; 


slope of chord = 


For example, if x changes only from 7 to 7.01, then the difference quotient (slope of the 
chord) is approximately equal to (23.997081 — 24)/0.01 = —0.2919. This is slightly less 
steep than the chord from (7, 24) to (7.1, 23.9706). 

As the second value 7 + Az moves in towards 7, the chord joining (7, f(7)) to (7 + 
Az, f(7+Az)) shifts slightly. As indicated in figure 2.1.1, as Ax gets smaller and smaller, 
the chord joining (7,24) to (7+ Az, f(7+ Az)) gets closer and closer to the tangent line 
to the circle at the point (7,24). (Recall that the tangent line is the line that just grazes 
the circle at that point, i.e., it doesn’t meet the circle at any second point.) Thus, as Ax 
gets smaller and smaller, the slope Ay/Az of the chord gets closer and closer to the slope 
of the tangent line. This is actually quite difficult to see when Az is small, because of the 
scale of the graph. The values of Ax used for the figure are 1, 5, 10 and 15, not really very 
small values. The tangent line is the one that is uppermost at the right hand endpoint. 


5 10 15 20 25 


Figure 2.1.1 Chords approximating the tangent line. (AP) 


So far we have found the slopes of two chords that should be close to the slope of 
the tangent line, but what is the slope of the tangent line exactly? Since the tangent line 
touches the circle at just one point, we will never be able to calculate its slope directly, 
using two “known” points on the line. What we need is a way to capture what happens 
to the slopes of the chords as they get “closer and closer” to the tangent line. 
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Instead of looking at more particular values of Az, let’s see what happens if we do 
some algebra with the difference quotient using just Az. The slope of a chord from (7, 24) 
to a nearby point is given by 


625 — (7+ Az)? —24 — \/625 — (7+ Az)? — 24 /625 — (7 + Ax)? + 24 
Aa - Ax 625 — (7 + Az)? + 24 
625 — (7 + Ax)? — 24? 
~ Aa(./625 — (7 + Ax)? + 24) 
— 49-49 — 14Aaz — Ax? 
~ Aa(./625 — (7 + Ax)? + 24) 
Az(—14 — Az) 
Ax(,/625 — (7 + Az)? + 24) 
_ —14— Ax 
6/625 — (7 + Ax)? +24 


Now, can we tell by looking at this last formula what happens when Az gets very close to 
zero? The numerator clearly gets very close to —14 while the denominator gets very close to 
V 625 — 72424 = 48. Is the fraction therefore very close to —14/48 = —7/24 = —0.29167? 
It certainly seems reasonable, and in fact it is true: as Ax gets closer and closer to zero, 
the difference quotient does in fact get closer and closer to —7/24, and so the slope of the 
tangent line is exactly —7/24. 

What about the slope of the tangent line at 7 = 12? Well, 12 can’t be all that different 
from 7; we just have to redo the calculation with 12 instead of 7. This won’t be hard, but 
it will be a bit tedious. What if we try to do all the algebra without using a specific value 
for x? Let’s copy from above, replacing 7 by x. We’ll have to do a bit more than that—for 
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example, the “24” in the calculation came from \/ 625 — 72, so we’ll need to fix that too. 
1/625 — (a + Az)? — V625— 2? | 
Ae 
_ 625 — (a + Az)? — 625 — a? ./625 — (x + Ax)? + 625 — 2? 
7 Ax (625 — (4 + Ax)? + 625 — 2? 
_ 625 — (x# + Az)? — 625+ 2? 
Aa (./625 — (a + Ax)? + 625 — 2?) 
— 625 — g? = 2eAc = Ag? ~— 625427 
Aa (\/625 — (x + Ax)? + V625 — x?) 
7 Az(—2z — Az) 
Aa (\/625 — (@ + Ax)? + V625 — x?) 
_ —2x — Ax 
4/625 — (a + Az)? + 625 — «2? 


Now what happens when Az is very close to zero? Again it seems apparent that the 


quotient will be very close to 
—22 _ —25 _ =f 
V625 — #2 + /625—22 2,/625—22 V625—22 


Replacing x by 7 gives —7/24, as before, and now we can easily do the computation for 12 


or any other value of x between —25 and 25. 

So now we have a single, simple formula, —x/1/ 625 — «?, that tells us the slope of the 
tangent line for any value of x. This slope, in turn, tells us how sensitive the value of y is 
to changes in the value of x. 

What do we call such a formula? That is, a formula with one variable, so that substi- 
tuting an “input” value for the variable produces a new “output” value? This is a function. 
Starting with one function, /625 — x2, we have derived, by means of some slightly nasty 
algebra, a new function, —2/1/625 — x”, that gives us important information about the 
original function. This new function in fact is called the derivative of the original func- 
tion. If the original is referred to as f or y then the derivative is often written f’ or y’ and 
pronounced “f prime” or “y prime”, so in this case we might write f’(2) = —xr/./625 — 2. 
At a particular point, say x = 7, we say that f’(7) = —7/24 or “f prime of 7 is —7/24” or 
“the derivative of f at 7 is —7/24.” 

To summarize, we compute the derivative of f(x) by forming the difference quotient 

flat Ae) ~ f(a) 
Agr ; 
which is the slope of a line, then we figure out what happens when Az gets very close to 
0. 


(2.1.1) 
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We should note that in the particular case of a circle, there’s a simple way to find the 
derivative. Since the tangent to a circle at a point is perpendicular to the radius drawn 
to the point of contact, its slope is the negative reciprocal of the slope of the radius. The 
radius joining (0,0) to (7,24) has slope 24/7. Hence, the tangent line has slope —7/24. In 


general, a radius to the point (2, \/ 625 — x?) has slope \/625 — 2?/a, so the slope of the 
tangent line is —x/./625 — x?, as before. It is NOT always true that a tangent line is 
perpendicular to a line from the origin—don’t use this shortcut in any other circumstance. 

As above, and as you might expect, for different values of x we generally get different 
values of the derivative f’(x). Could it be that the derivative always has the same value? 
This would mean that the slope of f, or the slope of its tangent line, is the same everywhere. 
One curve that always has the same slope is a line; it seems odd to talk about the tangent 
line to a line, but if it makes sense at all the tangent line must be the line itself. It is not 
hard to see that the derivative of f(x) = mx + bis f’(x) =m; see exercise 6. 


Exercises 2.1. 


1. Draw the graph of the function y = f(x) = 169 — x? between x = 0 and x = 13. Find the 
slope Ay/Az of the chord between the points of the circle lying over (a) x = 12 and x = 13, 
(b) « = 12 and « = 12.1, (c) x = 12 and x = 12.01, (d) e = 12 and x = 12.001. Now use 
the geometry of tangent lines on a circle to find (e) the exact value of the derivative f’(12). 
Your answers to (a)—(d) should be getting closer and closer to your answer to (e). => 


2. Use geometry to find the derivative f’(x) of the function f(x) = \/625 — x? in the text for 
each of the following x: (a) 20, (b) 24, (c) —7, (d) —15. Draw a graph of the upper semicircle, 
and draw the tangent line at each of these four points. => 


3. Draw the graph of the function y = f(x) = 1/x between x = 1/2 and x = 4. Find the slope 
of the chord between (a) x = 3 and x = 3.1, (b) x =3 and x = 3.01, (c) x =3 and z = 3.001. 
Now use algebra to find a simple formula for the slope of the chord between (3, f(3)) and 
(3 + Ag, f(3 + Ax)). Determine what happens when Az approaches 0. In your graph of 
y = 1/2, draw the straight line through the point (3, 1/3) whose slope is this limiting value 
of the difference quotient as Ax approaches 0. => 


A. Find an algebraic expression for the difference quotient ( f+Acx)—f (1)) /Azx when f(x) = 


x” — (1/a). Simplify the expression as much as possible. Then determine what happens as 
Ax approaches 0. That value is f’(1). > 


5. Draw the graph of y = f(x) = x between « = 0 and x = 1.5. Find the slope of the chord 
between (a) x = 1 and g = 1.1, (b) = 1 and x = 1.001, (c) g = 1 and x = 1.00001. 
Then use algebra to find a simple formula for the slope of the chord between 1 and 1+ Az. 
(Use the expansion (A + B)* = A® + 3A°B+3AB* + B®.) Determine what happens as Ax 
approaches 0, and in your graph of y = x” draw the straight line through the point (1,1) 
whose slope is equal to the value you just found. > 


6. Find an algebraic expression for the difference quotient (f(a + Aa) — f(x))/Azx when f(x) = 
ma + b. Simplify the expression as much as possible. Then determine what happens as Ax 
approaches 0. That value is f’(x). > 
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7. Sketch the unit circle. Discuss the behavior of the slope of the tangent line at various angles 
around the circle. Which trigonometric function gives the slope of the tangent line at an 
angle 0? Why? Hint: think in terms of ratios of sides of triangles. 


8. Sketch the parabola y = x”. For what values of x on the parabola is the slope of the tangent 
line positive? Negative? What do you notice about the graph at the point(s) where the sign 
of the slope changes from positive to negative and vice versa? 


2.2 AN EXAMPLE 


We started the last section by saying, “It is often necessary to know how sensitive the 
value of y is to small changes in x.” We have seen one purely mathematical example of 
this: finding the “steepness” of a curve at a point is precisely this problem. Here is a more 
applied example. 

With careful measurement it might be possible to discover that a dropped ball has 
height h(t) = ho—kt?, t seconds after it is released. (Here ho is the initial height of the ball, 
when t = 0, and k is some number determined by the experiment.) A natural question is 
then, “How fast is the ball going at time t?” We can certainly get a pretty good idea with a 
little simple arithmetic. To make the calculation more concrete, let’s say hy = 100 meters 
and k = 4.9 and suppose we’re interested in the speed at t = 2. We know that when t = 2 
the height is 100 —4-4.9 = 80.4. A second later, at t = 3, the height is 100 —9-4.9 = 55.9, 
so in that second the ball has traveled 80.4 — 55.9 = 24.5 meters. This means that the 
average speed during that time was 24.5 meters per second. So we might guess that 24.5 
meters per second is not a terrible estimate of the speed at t = 2. But certainly we can 
do better. At ¢ = 2.5 the height is 100 — 4.9(2.5)? = 69.375. During the half second from 
t = 2 to t = 2.5 the ball dropped 80.4 — 69.375 = 11.025 meters, at an average speed of 
11.025/(1/2) = 22.05 meters per second; this should be a better estimate of the speed at 
t = 2. So it’s clear now how to get better and better approximations: compute average 
speeds over shorter and shorter time intervals. Between t = 2 and t = 2.01, for example, 
the ball drops 0.19649 meters in one hundredth of a second, at an average speed of 19.649 
meters per second. 

We can’t do this forever, and we still might reasonably ask what the actual speed 
precisely at t = 2 is. If At is some tiny amount of time, what we want to know is what 
happens to the average speed (h(2) — h(2+ At))/At as At gets smaller and smaller. Doing 
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a bit of algebra: 


h(2)—h(2+ At) 80.4 — (100 — 4.9(2 + At)?) 
At ~ At 
80.4 — 100 + 19.6 + 19.6At + 4.9At? 
7 At 
19.6At + 4.9At? 
At 
19.6 + 4.9At 


When At is very small, this is very close to 19.6, and indeed it seems clear that as At 
goes to zero, the average speed goes to 19.6, so the exact speed at t = 2 is 19.6 meters per 
second. This calculation should look very familiar. In the language of the previous section, 
we might have started with f(x) = 100 — 4.92” and asked for the slope of the tangent line 
at 2 = 2. We would have answered that question by computing 


f(2+ Ax) — f(2) _ -19.6Aa — 4.9Az? 


= —19.6—4.9A 
Te Az 9.6 9Ax 


The algebra is the same, except that following the pattern of the previous section the 
subtraction would be reversed, and we would say that the slope of the tangent line is 
—19.6. Indeed, in hindsight, perhaps we should have subtracted the other way even for 
the dropping ball. At t = 2 the height is 80.4; one second later the height is 55.9. The 
usual way to compute a “distance traveled” is to subtract the earlier position from the 
later one, or 55.9 — 80.4 = —24.5. This tells us that the distance traveled is 24.5 meters, 
and the negative sign tells us that the height went down during the second. If we continue 
the original calculation we then get —19.6 meters per second as the exact speed at t = 2. 
If we interpret the negative sign as meaning that the motion is downward, which seems 
reasonable, then in fact this is the same answer as before, but with even more information, 
since the numerical answer contains the direction of motion as well as the speed. Thus, 
the speed of the ball is the value of the derivative of a certain function, namely, of the 
function that gives the position of the ball. (More properly, this is the velocity of the ball; 
velocity is signed speed, that is, speed with a direction indicated by the sign.) 

The upshot is that this problem, finding the speed of the ball, is exactly the same 
problem mathematically as finding the slope of a curve. This may already be enough 
evidence to convince you that whenever some quantity is changing (the height of a curve 
or the height of a ball or the size of the economy or the distance of a space probe from 
earth or the population of the world) the rate at which the quantity is changing can, in 
principle, be computed in exactly the same way, by finding a derivative. 


36 Chapter 2 Instantaneous Rate of Change: The Derivative 


Exercises 2.2. 


1. An object is traveling in a straight line so that its position (that is, distance from some fixed 
point) is given by this table: 


time (seconds) 


distance (meters) 


Find the average speed of the object during the following time intervals: [0,1], [0,2], [0,3], 
[1,2], [1,3], [2,3]. If you had to guess the speed at t = 2 just on the basis of these, what 
would you guess? => 


2. Let y = f(t) = 0’, where t is the time in seconds and y is the distance in meters that an 
object falls on a certain airless planet. Draw a graph of this function between t = 0 and 
t = 3. Make a table of the average speed of the falling object between (a) 2 sec and 3 sec, 
(b) 2 sec and 2.1 sec, (c) 2 sec and 2.01 sec, (d) 2 sec and 2.001 sec. Then use algebra to find 
a simple formula for the average speed between time 2 and time 2+ At. (If you substitute 
At = 1, 0.1, 0.01, 0.001 in this formula you should again get the answers to parts (a)—(d).) 
Next, in your formula for average speed (which should be in simplified form) determine what 
happens as At approaches zero. This is the instantaneous speed. Finally, in your graph 
of y = t’ draw the straight line through the point (2,4) whose slope is the instantaneous 
velocity you just computed; it should of course be the tangent line. > 


3. If an object is dropped from an 80-meter high window, its height y above the ground at time 
t seconds is given by the formula y = f(t) = 80— 4.9t?. (Here we are neglecting air resistance; 
the graph of this function was shown in figure 1.0.1.) Find the average velocity of the falling 
object between (a) 1 sec and 1.1 sec, (b) 1 sec and 1.01 sec, (c) 1 sec and 1.001 sec. Now use 
algebra to find a simple formula for the average velocity of the falling object between 1 sec 
and 1+ At sec. Determine what happens to this average velocity as At approaches 0. That 
is the instantaneous velocity at time t = 1 second (it will be negative, because the object is 
falling). = 


2.3 LIMITS 


In the previous two sections we computed some quantities of interest (slope, velocity) by 
seeing that some expression “goes to” or “approaches” or “gets really close to” a particular 
value. In the examples we saw, this idea may have been clear enough, but it is too fuzzy 
to rely on in more difficult circumstances. In this section we will see how to make the idea 
more precise. 

There is an important feature of the examples we have seen. Consider again the 


formula 


—19.6Ax — 4.9Ax? 
Ar , 


We wanted to know what happens to this fraction as “Az goes to zero.” Because we were 


able to simplify the fraction, it was easy to see the answer, but it was not quite as simple 
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as “substituting zero for Az,” as that would give 


—19.6-0—4.9-0 
0 y) 


which is meaningless. The quantity we are really interested in does not make sense “at 
zero,” and this is why the answer to the original problem (finding a velocity or a slope) 
was not immediately obvious. In other words, we are generally going to want to figure 
out what a quantity “approaches” in situations where we can’t merely plug in a value. If 
you would like to think about a hard example (which we will analyze later) consider what 
happens to (sinz)/x as x approaches zero. 


EXAMPLE 2.3.1 Does \/z approach 1.41 as x approaches 2? In this case it is possible 
to compute the actual value V2 to a high precision to answer the question. But since 
in general we won’t be able to do that, let’s not. We might start by computing \/x for 
values of x close to 2, as we did in the previous sections. Here are some values: 2.05 = 
1.431782106, V2.04 = 1.428285686, V2.03 = 1.424780685, V2.02 = 1.421267040, V2.01 = 
1.417744688, V2.005 = 1.415980226, V2.004 = 1.415627070, V2.003 = 1.415273825, 
V2.002 = 1.414920492, 2.001 = 1.414567072. So it looks at least possible that indeed 
these values “approach” 1.41—already V2.001 is quite close. If we continue this process, 
however, at some point we will appear to “stall.” In fact, V2 = 1.414213562..., so we will 


never even get as far as 1.4142, no matter how long we continue the sequence. 


So in a fuzzy, everyday sort of sense, it is true that x “gets close to” 1.41, but it 
does not “approach” 1.41 in the sense we want. 'To compute an exact slope or an exact 
velocity, what we want to know is that a given quantity becomes “arbitrarily close” to a 
fixed value, meaning that the first quantity can be made “as close as we like” to the fixed 
value. Consider again the quantities 


—19.6Ar — 4.9Ax? 


= —19.6—4.9Az. 
Ax = 


These two quantities are equal as long as Az is not zero; if Az is zero, the left hand 
quantity is meaningless, while the right hand one is —19.6. Can we say more than we 
did before about why the right hand side “approaches” —19.6, in the desired sense? Can 
we really make it “as close as we want” to —19.6? Let’s try a test case. Can we make 
—19.6 — 4.9Azx within one millionth (0.000001) of —19.6? The values within a millionth of 
—19.6 are those in the interval (—19.600001, —19.599999). As Az approaches zero, does 
—19.6 — 4.9Az eventually reside inside this interval? If Ax is positive, this would require 
that —19.6 — 4.9Ax > —19.600001. This is something we can manipulate with a little 
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algebra: 
—19.6 — 4.9Ax > —19.600001 


—4.9Azxz > —0.000001 
Az < —0.000001/ — 4.9 
Ax < 0.0000002040816327... 


Thus, we can say with certainty that if Az is positive and less than 0.0000002, then 
Az < 0.0000002040816327 ... and so —19.6 —4.9Axz > —19.600001. We could do a similar 
calculation if Az is negative. 

So now we know that we can make —19.6 — 4.9Az within one millionth of —19.6. But 
can we make it “as close as we want”? In this case, it is quite simple to see that the answer 
is yes, by modifying the calculation we’ve just done. It may be helpful to think of this as 
a game. I claim that I can make —19.6 — 4.9Az as close as you desire to —19.6 by making 
Ax “close enough” to zero. So the game is: you give me a number, like 10~°, and I have 
to come up with a number representing how close Az must be to zero to guarantee that 
—19.6 —4.9Az is at least as close to —19.6 as you have requested. 

Now if we actually play this game, I could redo the calculation above for each new 
number you provide. What I’d like to do is somehow see that I will always succeed, and 
even more, I’d like to have a simple strategy so that I don’t have to do all that algebra 
every time. A strategy in this case would be a formula that gives me a correct answer no 
matter what you specify. So suppose the number you give me is «. How close does Ax 
have to be to zero to guarantee that —19.6 — 4.9Az is in (—19.6 — €,-19.6+ €)? If Az is 
positive, we need: 

—19.6 —4.9Ax > —19.6 —€ 


—4,9Azr > -€ 
Ag < —e/ — 4.9 
Az < €/4.9 


So if I pick any number 6 that is less than €/4.9, the algebra tells me that whenever Az < 6 
then Az < €/4.9 and so —19.6 — 4.9Az is within € of —19.6. (This is exactly what I did 
in the example: I picked 6 = 0.0000002 < 0.0000002040816327....) A similar calculation 
again works for negative Ax. The important fact is that this is now a completely general 
result—it shows that I can always win, no matter what “move” you make. 

Now we can codify this by giving a precise definition to replace the fuzzy, “gets closer 
and closer” language we have used so far. Henceforward, we will say something like “the 
limit of (-19.6Axr—4.9Ax?)/Aa as Ax goes to zero is —19.6,” and abbreviate this mouthful 
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as 
~19.6Axr — 4.9Ax2 
pg ee Ee ge 
Az—>0 Az 


Here is the actual, official definition of “limit”. 


DEFINITION 2.3.2 Limit Suppose f is a function. We say that lim (6). = it 


for every € > 0 there is a 6 > 0 so that whenever 0 < |x —a| <0, |f(z) —L| <e. 


The ¢€ and 6 here play exactly the role they did in the preceding discussion. The 
definition says, in a very precise way, that f(z) can be made as close as desired to L 
(that’s the |f(x) — L| < e€ part) by making z close enough to a (the 0 < |x —a| < 6 
part). Note that we specifically make no mention of what must happen if 2 = a, that is, 
if | — a| = 0. This is because in the cases we are most interested in, substituting a for x 
doesn’t even make sense. 

Make sure you are not confused by the names of important quantities. The generic 
definition talks about f(x), but the function and the variable might have other names. In 
the discussion above, the function we analyzed was 


—19.6Ax — 4.9Ax? 
Ax . 


and the variable of the limit was not x but Az. The x was the variable of the original 
function; when we were trying to compute a slope or a velocity, x was essentially a fixed 
quantity, telling us at what point we wanted the slope. (In the velocity problem, it was 
literally a fixed quantity, as we focused on the time 2.) The quantity a of the definition 
in all the examples was zero: we were always interested in what happened as Ax became 
very close to zero. 

Armed with a precise definition, we can now prove that certain quantities behave in a 
particular way. The bad news is that even proofs for simple quantities can be quite tedious 
and complicated; the good news is that we rarely need to do such proofs, because most 
expressions act the way you would expect, and this can be proved once and for all. 


EXAMPLE 2.3.3 Let’s show carefully that lim a +4=6. This is not something we 
H oa 


“need” to prove, since it is “obviously” true. But if we couldn’t prove it using our official 
definition there would be something very wrong with the definition. 

As is often the case in mathematical proofs, it helps to work backwards. We want to 
end up showing that under certain circumstances x + 4 is close to 6; precisely, we want to 
show that |a + 4 — 6] < ¢, or |x — 2| < «. Under what circumstances? We want this to be 
true whenever 0 < |x — 2| < 6. So the question becomes: can we choose a value for 6 that 
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guarantees that 0 < |x — 2| < 6 implies |x — 2| < €? Of course: no matter what € is, 6 = € 


works. 


y) 


So it turns out to be very easy to prove something “obvious,” which is nice. It doesn’t 


take long before things get trickier, however. 


EXAMPLE 2.3.4 _ It seems clear that lim x? = 4. Let’s try to prove it. We will want 


x2 
to be able to show that |x? — 4| < € whenever 0 < |x — 2| < 6, by choosing 6 carefully. Is 
there any connection between |a — 2| and |x? — 4|? Yes, and it’s not hard to spot, but it is 
not so simple as the previous example. We can write |x? — 4| = |(a+2)(a —2)|. Now when 
|x —2| is small, part of |(~+2)(a—2)| is small, namely (a —2). What about (+2)? If z is 
close to 2, (x + 2) certainly can’t be too big, but we need to somehow be precise about it. 
Let’s recall the “game” version of what is going on here. You get to pick an € and I have to 
pick a 6 that makes things work out. Presumably it is the really tiny values of € I need to 
worry about, but I have to be prepared for anything, even an apparently “bad” move like 
€ = 1000. I expect that € is going to be small, and that the corresponding 6 will be small, 
certainly less than 1. If 6 < 1 then |x + 2] < 5 when |x — 2| < 6 (because if x is within 1 of 
2, then x is between 1 and 3 and x + 2 is between 3 and 5). So then I’d be trying to show 
that |(a + 2)(a — 2)| < 5\ja — 2| < «. So now how can I pick 6 so that |x — 2| < 6 implies 
5|a — 2| < €? This is easy: use 6 = €/5, so 5|x — 2| < 5(€/5) = €. But what if the « you 
choose is not small? If you choose € = 1000, should I pick 6 = 200? No, to keep things 


“sane” I will never pick a 6 bigger than 1. Here’s the final “game strategy:” 


When you 
pick a value for € I will pick 6 = €/5 or 6 = 1, whichever is smaller. Now when |x — 2| < 6, 
I know both that |# + 2] < 5 and that |a — 2| < €/5. Thus |(x + 2)(a — 2)| < 5(€/5) =e. 

This has been a long discussion, but most of it was explanation and scratch work. If 
this were written down as a proof, it would be quite short, like this: 


Proof that lim x? = 4. Given any e€, pick 6 = €/5 or 6 = 1, whichever is smaller. Then 
«2 


when |x —2| < 6, |e+2| <5 and |x—2| < €/5. Hence |x” — 4| = |(x +2)(x —2)| < 5(€/5) = 
€. 


It probably seems obvious that lim x? = 4, and it is worth examining more closely 
xr 


why it seems obvious. If we write 2? = x- x, and ask what happens when x approaches 2, 
we might say something like, “Well, the first x approaches 2, and the second x approaches 
2, so the product must approach 2-2.” In fact this is pretty much right on the money, 
except for that word “must.” Is it really true that if x approaches a and y approaches b 
then xy approaches ab? It is, but it is not really obvious, since x and y might be quite 
complicated. The good news is that we can see that this is true once and for all, and then 


2.3 Limits Al 


we don’t have to worry about it ever again. When we say that x might be “complicated” 
we really mean that in practice it might be a function. Here is then what we want to know: 


THEOREM 2.3.5 Suppose lim f(x) =L and lim g(x) = M. Then 

lites ot (eae) = LM. 

Proof. We have to use the official definition of limit to make sense of this. So given any 
€ we need to find a 6 so that 0 < |x —a| < 6 implies |f(x)g(x) — LM| < «. What do we 
have to work with? We know that we can make f(x) close to L and g(x) close to M, and 


we have to somehow connect these facts to make f(x)g(x) close to LM. 
We use, as is so often the case, a little algebraic trick: 


|f(x)g(@) — LM| = |f(x)g(@) — f(a) M + f(x) M — LM| 
= |f(x)(g(a) — M) + (f(@) — £)M| 
< |f(x)(g(x) — M)| + |(f(#) — £)M| 
= |f(@)||9(@) — M] + |f(%) — L||M]. 
This is all straightforward except perhaps for the “<”. That is an example of the triangle 


inequality, which says that if a and b are any real numbers then |a+ 6] < |a|+ |b]. If you 
look at a few examples, using positive and negative numbers in various combinations for 
a and 6, you should quickly understand why this is true; we will not prove it formally. 

Since jim f(x) = L, there is a value 6; so that 0 < |x —a| < 6; implies |f(x) — L| < 
|e/(2M)|, This means that 0 < |x — a| < 6; implies |f(x) — L||M| < €/2. You can see 
where this is going: if we can make |f(x)||g(a) — M| < €/2 also, then we’ll be done. 

We can make |g(z) — M| smaller than any fixed number by making x close enough 
to a; unfortunately, €/(2f(a)) is not a fixed number, since x is a variable. Here we need 
another little trick, just like the one we used in analyzing x?. We can find a 62 so that 
|x — al < 62 implies that | f(a) — L| < 1, meaning that LD —1< f(x) < L+1. This means 
that |f(x)| < N, where N is either |L — 1| or |Z + 1], depending on whether L is negative 
or positive. The important point is that N doesn’t depend on x. Finally, we know that 
there is a 63 so that 0 < |x — a| < 63 implies |g(a) — M| < «/(2N). Now we're ready to 
put everything together. Let 6 be the smallest of 61, 62, and 63. Then |x — a] < 6 implies 
that | f(x) — L| < |e/(2M)|, |f(x)| < N, and |g(a) — M| < «/(2N). Then 


|f(x)g(z) -— LM| < |f(z)|lo(x) — M| + |f(x) — LI| | 
< N5— + lace | || 


ae ae 


This is just what we needed, so by the official definition, lim f(x)g(x) = LM. a 
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A handful of such theorems give us the tools to compute many limits without explicitly 


working with the definition of limit. 


THEOREM 2.3.6 Suppose that lim ft) ess dang lim g(x) = M and k is some 


constant. Then 


lim k f(z) =k lim f(a) =kL 


lim (f(z) + g(2)) = lim f(z) + lim g(x) = L+M 
lim (f(x) — g(x)) = lim f(z) — lim g(x) = L~M 


lim (f(x)g(x)) = lim f(z) - lim g(x) = LM 


() _ limssef(a) _ if M is not 0 
(x) limp sag(t)  M’ 


= 
Roughly speaking, these rules say that to compute the limit of an algebraic expression, 


it is enough to compute the limits of the “innermost bits” and then combine these limits. 


This often means that it is possible to simply plug in a value for the variable, since 


Lia ae Sas 
x-a 

_ 2? —3r4+5 
EXAMPLE 2.3.7 Compute lim -—————. If we apply the theorem in all its gory 


x1 4 Ged 2 
detail, we get 


A ee = lim, 51(x2? — 32 + 5) 
z>1 @—-2 —— limg4i(2—2) 


Cig 43 x7) — (limy_,1 32) + (limg_,1 5) 


7 (lim,_,; x) — (lim,_,; 2) 


(lim,_,; 2)? — 3(lim,_,; 2) +5 
(lim, .; 2) — 2 


ee a 
= Lo 
1345 
= ——__ = - 3 
ae) 


It is worth commenting on the trivial limit lim 5. From one point of view this might 
2 


seem meaningless, as the number 5 can’t “approach” any value, since it is simply a fixed 
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number. But 5 can, and should, be interpreted here as the function that has value 5 
everywhere, f(x) = 5, with graph a horizontal line. From this point of view it makes sense 
to ask what happens to the height of the function as x approaches 1. 

Of course, as we’ve already seen, we’re primarily interested in limits that aren’t so easy, 
namely, limits in which a denominator approaches zero. There are a handful of algebraic 
tricks that work on many of these limits. 


2? 4+ We 3 
x—l 
that makes the denominator zero. However: 


EXAMPLE 2.3.8 Compute lim, . We can’t simply plug in x = 1 because 
2 


2 = _ 
lim 2 + 2x 3 _ im &@ 1)(a + 3) 


zl x—1 xl zx—l 


= lm(#+3)=4 
x1 


While theorem 2.3.6 is very helpful, we need a bit more to work easily with limits. 
Since the theorem applies when some limits are already known, we need to know the 
behavior of some functions that cannot themselves be constructed from the simple arith- 
metic operations of the theorem, such as \/x. Also, there is one other extraordinarily 
useful way to put functions together: composition. If f(x) and g(x) are functions, we can 
form two functions by composition: f(g(x)) and g(f(x)). For example, if f(x) = x and 


g(x) = «7 +5, then f(g(x)) = V22+5 and o(f(x)) = (/z)? +5 = 245. Here is a 


companion to theorem 2.3.6 for composition: 


THEOREM 2.3.9 Suppose that lim g(x) = L and lim f(x) = f(£). Then 
wa H er 


lim f(g(«)) = f(L). 


ra 
a 


Note the special form of the condition on f: it is not enough to know that lim fo= 
It 


M, though it is a bit tricky to see why. Many of the most familiar functions do have this 
property, and this theorem can therefore be applied. For example: 


THEOREM 2.3.10 Suppose that n is a positive integer. Then 


lim Va = Va, 


ra 


provided that a is positive if n is even. r 
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This theorem is not too difficult to prove from the definition of limit. 
Another of the most common algebraic tricks was used in section 2.1. Here’s another 


example: 


Yee ee, 
EXAMPLE 2.3.11 Compute lim v212—*. 
r——1 etl 


oY He on 4 VY“xt5—-2/r+5+4+2 


lim —————— = lim 


z>-l1 g¢+1 e>-1 @+1 VYr+542 
; x+5-—4 
= lm 
x+—1 (x +1)(Vr +5 + 2) 
: xt+l1 
= lm 
x+—1 (x +1)(Vr +5 + 2) 
i 1 1 
= 1m — 
e>-lV/a+5+2 4 


At the very last step we have used theorems 2.3.9 and 2.3.10. 


Occasionally we will need a slightly modified version of the limit definition. Consider 


the function f(z) = V1 -—.2?, the upper half of the unit circle. What can we say about 
lim f(x)? It is apparent from the graph of this familiar function that as x gets close to 1 
x 


from the left, the value of f(a) gets close to zero. It does not even make sense to ask what 
happens as x approaches 1 from the right, since f(x) is not defined there. The definition 
of the limit, however, demands that f(1 + Az) be close to f(1) whether Az is positive or 
negative. Sometimes the limit of a function exists from one side or the other (or both) even 
though the limit does not exist. Since it is useful to be able to talk about this situation, 
we introduce the concept of one sided limit: 


DEFINITION 2.3.12 One-sided limit Suppose that f(x) is a function. We say 
that lim f(x) = L if for every €« > 0 there is a d > 0 so that whenever 0 < a—z < 4, 
«ta 


| f(x) — L| < «. We say that lim,_,,+ f(x) = L if for every « > 0 there is a 6 > 0 so that 
whenever 0 < sx—a < 4, |f(x) —L| <e. 


Usually lim f(x) is read “the limit of f(x) from the left” and lim f(a) is read “the 
wa” 


wat 


limit of f(z) from the right”. 


EXAMPLE 2.3.13 Discuss lim aa lim =, and lim —. 
«0 |r|’ 20- |z| «0+ |2| 
The function f(x) = x/|z| is undefined at 0; when x > 0, |z| = x and so f(x) = 1; 
when x < 0, |z| = —z and f(z) = —1. Thus lim “= lim -1=—1while im “~ = 
a0- |z| 2—0- «0+ || 
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lim 1 = 1. The limit of f(x) must be equal to both the left and right limits; since they 


x—O0F 


are different, the limit lim i does not exist. 


«wr Fee 


Exercises 2.3. 


Compute the limits. If a limit does not exist, explain why. 


1. 


3. 


17. 


. a+ar—12 ap ge 19 
lim. ————_—_ => 2. lim ————_ 
x23 4 je 3 x21 cC— 3 

2 _ Ooch a 

a? oo A iy ES 

234 Go = 3 x22 4 be 2 
/ _ 1 1 

fin eS 6. lim 4/—+2 < 
a1 x—l1 ez—3o0t x x 

lim 3 > 8. lim 3a°—52 > 
x2 Za 

— 5 p2 2 

] ae 10. lim = 

xz—0 C= 1 zl eT 
=e =e 

ie 1). dee 

z—>O0t x ze+o+t e+1 
xv a? 2 3 

lim 14. lim(#* +4)" > 

Zora GT—-a x2 


1 
: lim, x sin (+) (Hint: Use the fact that |sina| < 1 for any real number a. You should 
Ls 9 6 


probably use the definition of a limit here.) => 


Give an e—0d proof, similar to example 2.3.3, of the fact that lim (2a —5)=3. 
x 
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18. Evaluate the expressions by reference to this graph: 


(a) lim fe) (b) Jim, f() (c) lim f(x) 
(4) lim (2) (e) lim, f() (f) f(-2) 
(g) lim f(z) (h) lim f(e) (i) lim f(@ + 1) 
() £0) (x) lim f@-4) (Jim, f@-2) 
=> 
19. Use a calculator to estimate iim me 
tan(32) 


20. Use a calculator to estimate lim : 
«0 tan(52) 


2.4 THE DERIVATIVE FUNCTION 


We have seen how to create, or derive, a new function f’(x) from a function f(x), summa- 
rized in the paragraph containing equation 2.1.1. Now that we have the concept of limits, 


we can make this more precise. 
DEFINITION 2.4.1 The derivative of a function f, denoted f’, is 


fla+ Ag) = f(z} 
tae Az , 


2.4 The Derivative Function AT 


We know that f’ carries important information about the original function f. In one 
example we saw that f’(x) tells us how steep the graph of f(x) is; in another we saw that 
f’(«) tells us the velocity of an object if f(a) tells us the position of the object at time x. 
As we said earlier, this same mathematical idea is useful whenever f(a) represents some 
changing quantity and we want to know something about how it changes, or roughly, the 
“rate” at which it changes. Most functions encountered in practice are built up from a 
small collection of “primitive” functions in a few simple ways, for example, by adding or 
multiplying functions together to get new, more complicated functions. To make good use 
of the information provided by f’(«) we need to be able to compute it for a variety of such 
functions. 

We will begin to use different notations for the derivative of a function. While initially 
confusing, each is often useful so it is worth maintaining multiple versions of the same 
thing. 

Consider again the function f(a”) = 625 —2?. We have computed the derivative 
f'(x) = —a//625 — x2, and have already noted that if we use the alternate notation 
y = V625— 2? then we might write y’ = —x//625—22. Another notation is quite 
different, and in time it will become clear why it is often a useful one. Recall that to 
compute the the derivative of f we computed 


= 2 
ia /625 — (7 + Az) 24 
Az—0 Ax 


The denominator here measures a distance in the x direction, sometimes called the “run”, 
and the numerator measures a distance in the y direction, sometimes called the “rise,” and 
“rise over run” is the slope of a line. Recall that sometimes such a numerator is abbreviated 
Ay, exchanging brevity for a more detailed expression. So in general, a derivative is given 
by 
y = lim Ay 
Azo Ax 


To recall the form of the limit, we sometimes say instead that 


In other words, dy/dzx is another notation for the derivative, and it reminds us that it is 
related to an actual slope between two points. This notation is called Leibniz notation, 
after Gottfried Leibniz, who developed the fundamentals of calculus independently, at 
about the same time that Isaac Newton did. Again, since we often use f and f(x) to mean 
the original function, we sometimes use df /dx and df(x)/dx to refer to the derivative. If 
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the function f(x) is written out in full we often write the last of these something like this 


= oy 625 — x? 


dx 


with the function written to the side, instead of trying to fit it into the numerator. 


EXAMPLE 2.4.2 Find the derivative of y = f(t) = t?. 
We compute 


2 2 
joie a CO 
At>0 At Ato At 
ve ee EAE ae Se 
= lim 
At30 At 
2tAt + At? 
—s 1m ey 
At30 At 


= lim. 26+ At = 2. 
At0 


Remember that At is a single quantity, not a “A” times a “t”, and so Af? is (At)? not 
Ae): 


EXAMPLE 2.4.3 Find the derivative of y = f(x) = 1/z. 
The computation: 


A ee | 
! F Yy , etAn x 
yf = i eS i, 
Az>0 Ar Az>0 Az 
x _ _&+Azxz 
fei x(a+Az) x(a+Az) 
Az—->0 Av 
x—(x+Az) 
x(a+Az) 


— 
=" 


=. ie ee Az 
~ Aro x(x + Ar)Ax 


=A) —Ax 
se 30 x(x + Axv)Ax 

. —l —l 
= lim = 


— Arsvoa(at+Ar) x? 


Note. If you happen to know some “derivative formulas” from an earlier course, for 
the time being you should pretend that you do not know them. In examples like the 
ones above and the exercises below, you are required to know how to find the derivative 
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formula starting from basic principles. We will later develop some formulas so that we do 
not always need to do such computations, but we will continue to need to know how to do 
the more involved computations. 

Sometimes one encounters a point in the domain of a function y = f(x) where there 
is no derivative, because there is no tangent line. In order for the notion of the tangent 
line at a point to make sense, the curve must be “smooth” at that point. This means that 
if you imagine a particle traveling at some steady speed along the curve, then the particle 
does not experience an abrupt change of direction. There are two types of situations you 
should be aware of—corners and cusps—where there’s a sudden change of direction and 


hence no derivative. 


EXAMPLE 2.4.4 Discuss the derivative of the absolute value function y = f(x) = |z]. 

If x is positive, then this is the function y = x, whose derivative is the constant 1. 
(Recall that when y = f(x) = ma +), the derivative is the slope m.) If x is negative, then 
we're dealing with the function y = —2x, whose derivative is the constant —1. If x = 0, 
then the function has a corner, i.e., there is no tangent line. A tangent line would have 
to point in the direction of the curve—but there are two directions of the curve that come 
together at the origin. We can summarize this as 


1 it > OS 
y =< a] ife < 0; 


undefined if x =O. 


EXAMPLE 2.4.5 

Discuss the derivative of the function y = «?/°, shown in figure 2.4.1. We will later see 
how to compute this derivative; for now we use the fact that y’ = (2/3)a~'/3. Visually this 
looks much like the absolute value function, but it technically has a cusp, not a corner. The 
absolute value function has no tangent line at 0 because there are (at least) two obvious 
contenders—the tangent line of the left side of the curve and the tangent line of the right 


2/3 does not have a tangent line at 0, but unlike the absolute value 


side. The function y = x 
function it can be said to have a single direction: as we approach 0 from either side the 
tangent line becomes closer and closer to a vertical line; the curve is vertical at 0. But as 
before, if you imagine traveling along the curve, an abrupt change in direction is required 


at 0: a full 180 degree turn. 


In practice we won’t worry much about the distinction between these examples; in both 
cases the function has a “sharp point” where there is no tangent line and no derivative. 
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Figure 2.4.1 A cusp on ot 


Exercises 2.4. 

Find the derivative of y = f(x) = /169—2?. > 
Find the derivative of y = f(t) = 80 —4.9t7. > 
Find the derivative of y = f(a) = a? — (1/x). > 


Find the derivative of y = f(a) = ax? + ba + c (where a, b, and c are constants). > 


Find the derivative of y = f(x) = 2°. > 


Se Pa 


Shown is the graph of a function f(x). Sketch the graph of f’(x) by estimating the derivative 
at a number of points in the interval: estimate the derivative at regular intervals from one 
end of the interval to the other, and also at “special” points, as when the derivative is zero. 
Make sure you indicate any places where the derivative does not exist. 

1.6 


1.4 


12 ri 


1.0 


0.8 


0.6 


0.4 


0.2 


1.0 —0.8 —0.6 —0.4 —0.2 00 02 04 06 08 1.0 


7. Shown is the graph of a function f(x). Sketch the graph of f’(x) by estimating the derivative 
at a number of points in the interval: estimate the derivative at regular intervals from one 
end of the interval to the other, and also at “special” points, as when the derivative is zero. 


2.5 Adjectives For Functions 51 


Make sure you indicate any places where the derivative does not exist. 
4 


0 


0) 1 2 3 4 5 


8. Find the derivative of y = f(x) = 2/V2zr+1=> 
9. Find the derivative of y = g(t) = (2t—1)/((+2) > 


10. Find an equation for the tangent line to the graph of f(x) = 5 — a — 3x” at the point « = 2 
=> 


11. Find a value for a so that the graph of f(x) = x” + ax — 3 has a horizontal tangent line at 
c=A, > 


2.5 ADJECTIVES FOR FUNCTIONS 


As we have defined it in Section 1.3, a function is a very general object. At this point, it 
is useful to introduce a collection of adjectives to describe certain kinds of functions; these 
adjectives name useful properties that functions may have. Consider the graphs of the 
functions in Figure 2.5.1. It would clearly be useful to have words to help us describe the 
distinct features of each of them. We will point out and define a few adjectives (there are 
many more) for the functions pictured here. For the sake of the discussion, we will assume 
that the graphs do not exhibit any unusual behavior off-stage (i.e., outside the view of the 
graphs). 

Functions. Each graph in Figure 2.5.1 certainly represents a function—since each passes 
the vertical line test. In other words, as you sweep a vertical line across the graph of each 
function, the line never intersects the graph more than once. If it did, then the graph 
would not represent a function. 

Bounded. The graph in (c) appears to approach zero as x goes to both positive and 
negative infinity. It also never exceeds the value 1 or drops below the value 0. Because the 
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(a) (b) 
34 eae 
2 4 
i 4 

—3 —2 —1 1 2 3 
-1,4 24 
2 —2 
ube 34 

(c) (d) 
34 3°: 
2 2 

1 2 3 


Figure 2.5.1 Function Types: (a) a discontinuous function, (b) a continuous function, (c) 
a bounded, differentiable function, (d) an unbounded, differentiable function 


graph never increases or decreases without bound, we say that the function represented by 
the graph in (c) is a bounded function. 


DEFINITION 2.5.1 Bounded _ A function f is bounded if there is a number M 
such that |f(a)| < M for every x in the domain of f. 


For the function in (c), one such choice for M would be 10. However, the smallest 
(optimal) choice would be M = 1. In either case, simply finding an M is enough to 
establish boundedness. No such M exists for the hyperbola in (d) and hence we can say 
that it is unbounded. 

Continuity. The graphs shown in (b) and (c) both represent continuous functions. 
Geometrically, this is because there are no jumps in the graphs. That is, if you pick a 
point on the graph and approach it from the left and right, the values of the function 
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approach the value of the function at that point. For example, we can see that this is not 
true for function values near xz = —1 on the graph in (a) which is not continuous at that 
location. 


DEFINITION 2.5.2 Continuous at a Point A function f is continuous at a point 
a if lim f(x) = f(a). 
xLz-a 


DEFINITION 2.5.3 Continuous A function f is continuous if it is continuous at 


every point in its domain. 


Strangely, we can also say that (d) is continuous even though there is a vertical asymp- 
tote. A careful reading of the definition of continuous reveals the phrase “at every point 
in its domain.” Because the location of the asymptote, x = 0, is not in the domain of 
the function, and because the rest of the function is well-behaved, we can say that (d) is 
continuous. 

Differentiability. Now that we have introduced the derivative of a function at a point, 
we can begin to use the adjective differentiable. We can see that the tangent line is well- 
defined at every point on the graph in (c). Therefore, we can say that (c) is a differentiable 


function. 


DEFINITION 2.5.4 Differentiable at a Point A function f is differentiable at 
point a if f’(a) exists. 


DEFINITION 2.5.5 Differentiable A function f is differentiable if is differentiable 
at every point (excluding endpoints and isolated points in the domain of f) in the domain 
of f. 


Take note that, for technical reasons not discussed here, both of these definitions 
exclude endpoints and isolated points in the domain from consideration. 

We now have a collection of adjectives to describe the very rich and complex set of 
objects known as functions. 

We close with a useful theorem about continuous functions: 


THEOREM 2.5.6 Intermediate Value Theorem _ If f is continuous on the interval 
[a,b] and d is between f(a) and f(b), then there is a number c in [a,b] such that f(c) = d. 
= 


This is most frequently used when d = 0. 
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EXAMPLE 2.5.7 Explain why the function f = x? + 3x7 + 2 — 2 has a root between 
0 and 1. 

By theorem 2.3.6, f is continuous. Since f(0) = —2 and f(1) = 3, and 0 is between 
—2 and 3, there is a c € [0,1] such that f(c) = 0. 


This example also points the way to a simple method for approximating roots. 


EXAMPLE 2.5.8 Approximate the root of the previous example to one decimal place. 

If we compute f(0.1), f(0.2), and so on, we find that f (0.6) < 0 and f(0.7) > 0, so by 
the Intermediate Value Theorem, f has a root between 0.6 and 0.7. Repeating the process 
with f (0.61), f(0.62), and so on, we find that f (0.61) < 0 and f(0.62) > 0, so f has a root 
between 0.61 and 0.62, and the root is 0.6 rounded to one decimal place. 


Exercises 2.5. 
1. Along the lines of Figure 2.5.1, for each part below sketch the graph of a function that is: 
a. bounded, but not continuous. 
b. differentiable and unbounded. 
c. continuous at x = 0, not continuous at x = 1, and bounded. 
d. differentiable everywhere except at x = —1, continuous, and unbounded. 
Is f(a) = sin(x) a bounded function? If so, find the smallest M. 
. Is s(t) =1/(1+#") a bounded function? If so, find the smallest M. 
. Is v(u) = 2ln |u| a bounded function? If so, find the smallest M. 


2e—3, ifa<1 
me) = 15 ife>1. 


aR ON 


. Consider the function 


Show that it is continuous at the point x = 0. Is h a continuous function? 
6. Approximate a root of f = x — 4x” + 2x + 2 to one decimal place. > 


7. Approximate a root of f = 2* +2* — 5a +1 to one decimal place. 


3 


Rules for Finding Derivatives 


It is tedious to compute a limit every time we need to know the derivative of a function. 
Fortunately, we can develop a small collection of examples and rules that allow us to 
compute the derivative of almost any function we are likely to encounter. Many functions 
involve quantities raised to a constant power, such as polynomials and more complicated 
combinations like y = (sinx)*. So we start by examining powers of a single variable; this 
gives us a building block for more complicated examples. 


3.1 THE POWER RULE 


We start with the derivative of a power function, f(z) = x”. Here n is a number of 


any kind: integer, rational, positive, negative, even irrational, as in 7”. We have already 
computed some simple examples, so the formula should not be a complete surprise: 


d 


—— ie eee 
dx 


=e 
It is not easy to show this is true for any n. We will do some of the easier cases now, and 
discuss the rest later. 

The easiest, and most common, is the case that n is a positive integer. To compute 
the derivative we need to compute the following limit: 


For a specific, fairly small value of n, we could do this by straightforward algebra. 
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EXAMPLE 3.1.1 Find the derivative of f(x) = 2°. 


4.492 jm Gta = 8 
dx’ + Az—0 Ax ; 
; xv? + 3a2Ax + 32Az? + Az? — x3 
lim . 
Az—30 Az 
: 3x22Ax + 3rAr? + Ax? 
= lm : 
Az 30 Av 
lim 327 + 32Azr + Ax? = 32°. 
Az—-0 


The general case is really not much harder as long as we don’t try to do too much. 
The key is understanding what happens when (x + Ax)” is multiplied out: 


(2+ Ax)” = ao" + na” Ag + age” 7Ar? +--+ + +ay_1r2Ar" 1! + Ac”. 


We know that multiplying out will give a large number of terms all of the form x’ Az’, and 
in fact that 7+ j7 = n in every term. One way to see this is to understand that one method 
for multiplying out (« + Az)” is the following: In every (2 + Az) factor, pick either the x 
or the Az, then multiply the n choices together; do this in all possible ways. For example, 
for (x + Ax), there are eight possible ways to do this: 


(a + Ax)(a + Ax)(a + Av) = cax+ cxAxr 4+ cAre + xcArAr 
+ Arar + Arx Ax + ArAgz + AxArAxz 
=? +a7Axc+a7An + cAx? 
+ 27Agx + cAx? + cAx? + Ac? 
= g° + 3x7Ax + 32Az? + Ax? 
No matter what n is, there are n ways to pick Az in one factor and x in the remaining 
n—1 factors; this means one term is nz” !Ax. The other coefficients are somewhat harder 
to understand, but we don’t really need them, so in the formula above they have simply 


been called a2, a3, and so on. We know that every one of these terms contains Az to at 
least the power 2. Now let’s look at the limit: 


— . («+Anz)" — 2” 
= lim 


ae ea 
de KEAG Ax 
3. One Ae aan” 7 Aer eto, pet Agta 
= lim 
Az—0 Ax 
. Kae Ae age’ Rat as bag eee Ae” 
= lim 
Az—>0 Ax 


im na sg A Ag Vase Ag oat aa, 
x 
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Now without much trouble we can verify the formula for negative integers. First let’s 
look at an example: 


EXAMPLE 3.1.2 Find the derivative of y = x~*. Using the formula, y! = —3a7°~' = 
—37*. 


Here is the general computation. Suppose n is a negative integer; the algebra is easier 


to follow if we use n = —m in the computation, where m is a positive integer. 
d d Ag) =n" 
Se Se ln ree) : 
da dx Ar>0 Ax 
1 1 
=. i (a+Ax)™ xm 
Az—->0 Ax 


_ a” — (e+ Ar)™ 
lim 
Az0 (4 + Ar)™a™Az 


x™ — (2™ + ma™-1Ag + agr™ 7 Ag? + +++ + am—12Arz™ 1! + Az™) 


| 
E 


Ar—0 (e+ Ag)™a™A 
i —ma™—1 — agx™-* Ag — ++» — am_12Aa™-? — Az™1) 
= lim 
Az0 (e+ Ag)™a™ 
—1 m—-1 
—max™ —M2x ae ee 7 
= —__ = = = —my™ 172m = ng ™1 = ng” 1 
Le x 


We will later see why the other cases of the power rule work, but from now on we will 
use the power rule whenever n is any real number. Let’s note here a simple case in which 
the power rule applies, or almost applies, but is not really needed. Suppose that f(x) = 1; 
remember that this “1” is a function, not “merely” a number, and that f(x) = 1 has a 
graph that is a horizontal line, with slope zero everywhere. So we know that f’(x) = 0. 
We might also write f(x) = «°, though there is some question about just what this means 
at_ 2 = 0. If we apply the power rule, we get f’(x) = Or~' = 0/x = 0, again noting that 
there is a problem at x = 0. So the power rule “works” in this case, but it’s really best to 
just remember that the derivative of any constant function is zero. 


Exercises 3.1. 


Find the derivatives of the given functions. 


1. x — a. 2 => 
1 

3.0 = SS 4.2" > 
x 
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3.2 LINEARITY OF THE DERIVATIVE 


An operation is linear if it behaves “nicely” with respect to multiplication by a constant 
and addition. The name comes from the equation of a line through the origin, f(x) = mz, 
and the following two properties of this equation. First, f(cr) = m(cx) = c(mx) = cf(z), 
so the constant c can be “moved outside” or “moved through” the function f. Second, 
f(aty) =m(a+y) =maxt+my = f(x) + f(y), so the addition symbol likewise can be 
moved through the function. 

The corresponding properties for the derivative are: 


,_ a _ ad ot Sie) 
(cf (a))! = efx) = = Fle) = ef"), 
and F i F 
(f(x) + g(x))’ = ag fe) + o(2)) = GF Flz) + J g(a) = f(a) + 9'(2). 


It is easy to see, or at least to believe, that these are true by thinking of the dis- 
tance/speed interpretation of derivatives. If one object is at position f(t) at time t, we 
know its speed is given by f’(t). Suppose another object is at position 5f(t) at time t, 
namely, that it is always 5 times as far along the route as the first object. Then it “must” 
be going 5 times as fast at all times. 

The second rule is somewhat more complicated, but here is one way to picture it. 
Suppose a flat bed railroad car is at position f(t) at time t, so the car is traveling at a 
speed of f’(t) (to be specific, let’s say that f(t) gives the position on the track of the rear 
end of the car). Suppose that an ant is crawling from the back of the car to the front so 
that its position on the car is g(t) and its speed relative to the car is g’(t). Then in reality, 
at time t, the ant is at position f(t) + g(t) along the track, and its speed is “obviously” 
f(t) + 9'(t). 

We don’t want to rely on some more-or-less obvious physical interpretation to deter- 
mine what is true mathematically, so let’s see how to verify these rules by computation. 


3.2 Linearity of the Derivative 


We’ll do one and leave the other for the exercises. 


4 (() 4 g(a) = tim Se +AD) +(e + Az) ~ (Fe) +9) 


dx Az—0 AT 

im {let Ae) + gle + Ax) = fle) = lw) 

Az—0 Ax 
— am Leet Ax) = flo) + gle + Ae) — lw) 

Az—0 Ax 

, f(e@+Az)— f(z) . g(e+Az) — g(x) 

- A ( Ag - Ag ) 
_. fle+Az)-flx), . gle+Az) —g(z) 
i est At . roa Ax 
= f'(x) +9'(2) 


This is sometimes called the sum rule for derivatives. 
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EXAMPLE 3.2.1 Find the derivative of f(2) = x° + 5x”. We have to invoke linearity 


twice here: 


f(x) = Lae + 527) = Hoe + # (542) = 524+ 5 (an) =5e*4 5-22) = 5a* + 10z. 


into a single step. The following example shows an acceptably detailed computation. 


dx dx dx dx 


EXAMPLE 3.2.2 Find the derivative of f(x) = 3/a* — 2x? + 6x — 7. 


d 
f'(a) us (= — 2x? + 6x — 7) = 7 3e — 227 + 62 — 7) =—12a° — 4 +6. 


dx \ x4 


Exercises 3.2. 


Find the derivatives of the functions in 1-6. 


1. 


TO AR wD 


Ba® + 1227-15 > 

—4e° + 327 — 5/2? > 

5(—30" + 5¢+1) > 

f(x) +(x), where f(x) = x” — 3x 42 and g(x) = 22° — 52 > 

(x +1)(a{* + 22-3) > 

/625 — x? + 34° + 12 (See section 2.1.) > 

Find an equation for the tangent line to f(x) = 2° /4—1/a at «= —-2. > 


Because it is so easy with a little practice, we can usually combine all uses of linearity 
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8. Find an equation for the tangent line to f(a) = 3a? — 2° at r=4. > 


9. Suppose the position of an object at time ¢ is given by f(t) = —49#?/10+5t+10. Find a 
function giving the speed of the object at time t. The acceleration of an object is the rate at 
which its speed is changing, which means it is given by the derivative of the speed function. 
Find the acceleration of the object at time t. > 


10. Let f(x) = 2° and c= 3. Sketch the graphs of f, cf, f’, and (cf)’ on the same diagram. 
11. The general polynomial P of degree n in the variable x has the form P(x) = So axa" = 
k=0 
ao +aix+...+an2". What is the derivative (with respect to x) of P? > 


12. Find a cubic polynomial whose graph has horizontal tangents at (—2,5) and (2,3). > 


13. Prove that + (cf(a)) = cf'(x) using the definition of the derivative. 
Bs 


14. Suppose that f and g are differentiable at x. Show that f — g is differentiable at x using the 
two linearity properties from this section. 


3.3 THE PrRopDucT RULE 


Consider the product of two simple functions, say f(x) = (x7 + 1)(a? — 3x). An obvious 
guess for the derivative of f is the product of the derivatives of the constituent functions: 
(2x) (327-3) = 6x? —6z. Is this correct? We can easily check, by rewriting f and doing the 
calculation in a way that is known to work. First, f(a) = x°—32°+a°—3a = 2° —22°—32, 
and then f’(«) = 5a*—6x? —3. Not even close! What went “wrong”? Well, nothing really, 
except the guess was wrong. 

So the derivative of f(x)g(x) is NOT as simple as f’(x)g’(x). Surely there is some 
rule for such a situation? There is, and it is instructive to “discover” it by trying to do 
the general calculation even without knowing the answer in advance. 


d — 4. f(c+ Ax)g(a + Ax) — f(x)g(x) 
ag of (#)9(2)) = Jim, ee 
— ym Let Acigle + Ax) — f(a + Az)g(z) + fe + Ax) ge) — f@)g9(@) 
Az 0 Az 
— ym Let Ariale + Az) — fle + Az)g(z) Ae f(a +Az)g(x) — f(x)g(z) 
~~ Ar—>0 Ax Az—0 Ag 
= lim, fle + An) SSIES + in, te 


| 
—. 
= 
a 
S 
+ 
a 
oS 
= 
os 


A couple of items here need discussion. First, we used a standard trick, “add and subtract 
the same thing”, to transform what we had into a more useful form. After some rewriting, 
we realize that we have two limits that produce f’(x) and g/(x). Of course, f’(x) and 
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g'(x) must actually exist for this to make sense. We also replaced Jim, f(a + Az) with 
wt 
f (x)—why is this justified? 
What we really need to know here is that Jim | f(a+Az) = f(x), or in the language 
xr~— 


of section 2.5, that f is continuous at 2. We already know that f’(x) exists (or the whole 
approach, writing the derivative of fg in terms of f’ and g’, doesn’t make sense). This 
turns out to imply that f is continuous as well. Here’s why: 


Aim fle + Aa) = lim (f(a + Az) — f(z) + F(x) 


= lim 
= f'(xz)-0+ f(x) = f(a) 


To summarize: the product rule says that 
=. (F(2)9(@)) = f(x)g'(x) + f'(z)g(z). 


Returning to the example we started with, let f(a) = (x? +1)(x? — 3x). Then f’(x) = 
(a? + 1)(3a? — 3) + (22)(a? — 32) = 3z* — 3a? + 32” —3 420° — 6a? = 5a” — 62” — 3, 
as before. In this case it is probably simpler to multiply f(x) out first, then compute the 
derivative; here’s an example for which we really need the product rule. 


EXAMPLE 3.3.1 Compute the derivative of f(x) = 27.625 — x?. We have already 


d —2£ 
computed — 1/625 — «2 = ———————. Now 
, dx V625 — x? 


—£ —ax? + 22(625— 27) —8x> + 12502 
————. 27/625 — 2? = = : 
V625 — x? V625 — x? V625 — x? 


fi(e) = 2° 


Exercises 3.3. 


In 1-4, find the derivatives of the functions using the product rule. 
1. a°(x*® —5x4+10) > 

. (a? + 5x — 3)(a° — 62° + 32? — 74 +1) > 

. JeV/625 — x? > 

625 — x2 = 


720 


wo bv 


A, 
5. Use the product rule to compute the derivative of f(#) = (2x — 3)°. Sketch the function. 


Find an equation of the tangent line to the curve at « = 2. Sketch the tangent line at x = 2. 
=> 


62 


6. 


7. 
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Suppose that f, g, and h are differentiable functions. Show that (fgh)' (x) = f'(x)g(x)h(x)+ 
f(x)g'(w)h(a) + f(@)g(@)h'(2). 


State and prove a rule to compute (fghi)'(x), similar to the rule in the previous problem. 


Product notation. Suppose fi, fe,... fn are functions. The product of all these functions 


can be written 
[| &- 
k=1 


This is similar to the use of NE to denote a sum. For example, 


5 
I] fe = fihefsfafs 
k=1 


and 7 
livelier 
k=1 


We sometimes use somewhat more complicated conditions; for example 
n 
I] fs 
k=1,k7j 


denotes the product of f1 through f, except for f;. For example, 


Il ko 2 3 5 41 
CS bide ae ea oS 
k=1,k#4 
. The generalized product rule says that if fi, fo,..., fn are differentiable functions at x 
then 
d n n n 
all fe@m => fix) [I fee) 
k=1 j=l k=1,k4j 


Verify that this is the same as your answer to the previous problem when n = 4, and write 
out what this says when n = 5. 


3.4 THE QUOTIENT RULE 


What is the derivative of (x? + 1)/(a? — 3x)? More generally, we’d like to have a formula 
to compute the derivative of f(x)/g(ax) if we already know f’(x) and g’(x). Instead of 
attacking this problem head-on, let’s notice that we’ve already done part of the problem: 
f(x)/g(x) = f(x) -(1/g(x)), that is, this is “really” a product, and we can compute the 
derivative if we know f’(x) and (1/g(x))’. So really the only new bit of information we 
need is (1/g(x))/ in terms of g'(x). As with the product rule, let’s set this up and see how 
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far we can get: 


1 AB) — a6) 
se ff g(a+Aaz g(x 
dx g(x) Az+0 Az 
ree 
‘ g(@+Axz)g(x 
pee Ax 
say $2) — 9a + Az) 
Az0 g(a + Ax)g(x)Ax 


= i _ g(@ + Ax) — g(x) il 
Az—0 Az g(a + Ax)g(ax) 
__ 9g (2) 
g(x)? 


Now we can put this together with the product rule: 


ad f(x) _ f(a) ~9' (2) ies —flx)g' (a) + F(a)g(e) _ fle) g(a) — Fle) g'(a) 
dx g(x) g(a)? 


g(x) g(a)? g(a)? 


EXAMPLE 3.4.1 Compute the derivative of (x? + 1)/(x? — 32). 


d 2?+1 — 2e(a? — 3x) — (2? +1)(32?-3)  -a2*-62?+3 
dx x? —32 (a? — 32)? (43 — 3x)? 


It is often possible to calculate derivatives in more than one way, as we have already 
seen. Since every quotient can be written as a product, it is always possible to use the 
product rule to compute the derivative, though it is not always simpler. 


EXAMPLE 3.4.2. Find the derivative of \/625 — x?/,/x in two ways: using the quotient 
rule, and using the product rule. 
Quotient rule: 
d J625—a? — Ja(—2/V625 — x?) — /625 — x? -1/(2/z) 
dx f/x e 


Note that we have used \/x = x'/? to compute the derivative of \/z by the power rule. 


Product rule: 


d —1 —2£ 
— 4/625 — 22¢ 71/2 = 4/625 — 22 — 73/2 4. eg 
dx 2 V625 — x? 


With a bit of algebra, both of these simplify to 


x? + 625 
2/625 — x2x73/2 
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Occasionally you will need to compute the derivative of a quotient with a constant 
numerator, like 10/”?. Of course you can use the quotient rule, but it is usually not the 
easiest method. If we do use it here, we get 


d 10 «?-0-10-2¢ —20 
dx x2 x4 3 


since the derivative of 10 is 0. But it is simpler to do this: 


d 10 d 

—— = Nr FS 208. 

dez* dz 
Admittedly, x? is a particularly simple denominator, but we will see that a similar calcu- 
lation is usually possible. Another approach is to remember that 


dl _ ~g'(x) 
dx g(x) g(x)? 


but this requires extra memorization. Using this formula, 


d 10 | —20 
dx x? et 


Note that we first use linearity of the derivative to pull the 10 out in front. 


Exercises 3.4. 


Find the derivatives of the functions in 1—4 using the quotient rule. 


1 x? ve 2 ae? +5r—3 ~ 
* 23 —5a+10 * 95 — 623 + 322 —7r+4+1 
vz, 4 Vee a 
625 — x2 ; >? 


5. Find an equation for the tangent line to f(x) = (x? — 4)/(5— 2x) atr=3. => 
6. Find an equation for the tangent line to f(a) = (a — 2)/(a@®° + 4a —1) atw=1. > 


7. Let P be a polynomial of degree n and let Q be a polynomial of degree m (with Q not the 
zero polynomial). Using sigma notation we can write 


P=). ae", => tc”. 
k=0 k=0 
Use sigma notation to write the derivative of the rational function P/Q. 


8. The curve y = 1/(1+ x”) is an example of a class of curves each of which is called a witch 
of Agnesi. Sketch the curve and find the tangent line to the curve at x = 5. (The word 
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witch here is a mistranslation of the original Italian, as described at 


http: //mathworld.wolfram.com/WitchofAgnesi.html 


ai http://instructional1.calstatela.edu/sgray/Agnesi/ 
WitchHistory/Historynamewitch. html.) 
=> 
9. If f’(4) =5, g/(4) = 12, (fg)(4) = f(4)9(4) = 2, and g(4) = 6, compute f(4) and = at 4. 
=> 


3.5 THE CHAIN RULE 


So far we have seen how to compute the derivative of a function built up from other 
functions by addition, subtraction, multiplication and division. There is another very 
important way that we combine simple functions to make more complicated functions: 
function composition, as discussed in section 2.3. For example, consider \/625 — x?. This 
function has many simpler components, like 625 and x”, and then there is that square root 
symbol, so the square root function xz = z'/? is involved. The obvious question is: can 
we compute the derivative using the derivatives of the constituents 625 — x” and //z? We 
can indeed. In general, if f(a) and g(x) are functions, we can compute the derivatives of 
f(g(x)) and g(f(x)) in terms of f’(x) and g’(z). 


EXAMPLE 3.5.1 Form the two possible compositions of f(z) = Vz and g(x) = 
625 — x” and compute the derivatives. First, f(g(x)) = \/625 — «?, and the derivative 


is —a/./625 — x2 as we have seen. Second, g(f(x)) = 625 — (\/x)? = 625 — x with deriva- 
tive —1. Of course, these calculations do not use anything new, and in particular the 


derivative of f(g(x)) was somewhat tedious to compute from the definition. 


Suppose we want the derivative of f(g(xz)). Again, let’s set up the derivative and play 
some algebraic tricks: 


f(g(2)) = im 


Az—0 Ax 
— tm Lge +A2)) — Flg(@)) gle + Ax) — g(a) 
Az30 = g(a + Ax) — g(x) At 


Now we see immediately that the second fraction turns into g’(x) when we take the limit. 
The first fraction is more complicated, but it too looks something like a derivative. The 
denominator, g(x + Ax) — g(x), is a change in the value of g, so let’s abbreviate it as 
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Ag = g(x + Ax) — g(x), which also means g(x + Ax) = g(x) + Ag. This gives us 


As Ax goes to 0, it is also true that Ag goes to 0, because g(x + Ax) goes to g(x). So we 


can rewrite this limit as 


Now this looks exactly like a derivative, namely f’(g(x)), that is, the function f’(a) with 
x replaced by g(x). If this all withstands scrutiny, we then get 


 f(a(2)) = £(a(a))9(2). 


Unfortunately, there is a small flaw in the argument. Recall that what we mean by lima,_.0 
involves what happens when Az is close to 0 but not equal to 0. The qualification is very 
important, since we must be able to divide by Ax. But when Az is close to 0 but not equal 
to 0, Ag = g(a + Ax)) — g(x) is close to 0 and possibly equal to 0. This means it doesn’t 
really make sense to divide by Ag. Fortunately, it is possible to recast the argument to 
avoid this difficulty, but it is a bit tricky; we will not include the details, which can be 
found in many calculus books. Note that many functions g do have the property that 
g(a + Ax) — g(x) # 0 when Az is small, and for these functions the argument above is 
fine. 

The chain rule has a particularly simple expression if we use the Leibniz notation for 
the derivative. The quantity f’(g(x)) is the derivative of f with x replaced by g; this can 
be written df/dg. As usual, g/(a) = dg/dx. Then the chain rule becomes 


a a a ag 
dx dgdx’ 


This looks like trivial arithmetic, but it is not: dg/dzx is not a fraction, that is, not literal 
division, but a single symbol that means g’(x). Nevertheless, it turns out that what looks 
like trivial arithmetic, and is therefore easy to remember, is really true. 

It will take a bit of practice to make the use of the chain rule come naturally—it is 
more complicated than the earlier differentiation rules we have seen. 


EXAMPLE 3.5.2 Compute the derivative of (625 —2x?. We already know that the 
answer is —2/.\/625 — x?, computed directly from the limit. In the context of the chain 
rule, we have f(x) = Vz, g(x) = 625 — 27. We know that f’(x) = (1/2)a7/?, so 
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f'(g(x)) = (1/2)(625 — x?)~/?. Note that this is a two step computation: first compute 
f'(x), then replace x by g(x). Since g’(x) = —2x we have 


1 —2x 


f'(g(2))g'(x) = ipa = Vena 


EXAMPLE 3.5.3 Compute the derivative of 1/./625 — x?. This is a quotient with a 
constant numerator, so we could use the quotient rule, but it is simpler to use the chain 
rule. The function is (625 —«”)~'/?, the composition of f (2) = «7'/? and g(x) = 625-2”. 
We compute f’(x) = (—1/2)~°/? using the power rule, and then 


—1 | p= x 
2(625 — @2)8/2 “"? ~ (625 — 22)3/2" 


f'(g(a))9'(x) = 


In practice, of course, you will need to use more than one of the rules we have developed 
to compute the derivative of a complicated function. 


EXAMPLE 3.5.4 Compute the derivative of 


g? +1 


LD) ae 


The “last” operation here is division, so to get started we need to use the quotient rule 
first. This gives 


(2? — 1)'aVa2? +1 — (a? —1)(xVx? +1)’ 


fe a2(x2 + 1) 
Qa?7V x? +1 — (a? — 1)(aV ax? + 1)’ 
> x?(x? + 1) , 


Now we need to compute the derivative of r\/x2+1. This is a product, so we use the 
product rule: 


faV@ 41 =25 Veit Ve tT 


Finally, we use the chain rule: 
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And putting it all together: 


Qa? 4/a2 +1 = (x? — 1) (ava? + 1)! 


2274/¢2-+ 1 — (x? —1) cS + Var? + i) 
- x?(a? + 1) 


This can be simplified of course, but we have done all the calculus, so that only algebra is 
left. 


EXAMPLE 3.5.5 Compute the derivative of 4/1 + \/1+ x. Here we have a more 


complicated chain of compositions, so we use the chain rule twice. At the outermost “layer” 


we have the function g(x) = 1+ 1/1+ Vz plugged into f(x) = Vz, so applying the chain 


rule once gives 


d 1 ae: 
—1+Vl4+Ve==(14+yV1live —{1l+yVl+vz}. 
dx 2 dx 

Now we need the derivative of \/1+ Vz. Using the chain rule again: 


d =| =. eee 
aVitve=5 (1+ v2) 3% ae 


So the original derivative is 


d | 1 ee ae jel. 
ae 14 i+ ve=5 (14 yi+ve) 5 (l+v2) "52 Me, 


i 
8J/rV1t+ JevV14+V714+ Ve 


Using the chain rule, the power rule, and the product rule, it is possible to avoid using 
the quotient rule entirely. 
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3 
EXAMPLE 3.5.6 Compute the derivative of f(x) = 


then 


oT Write f(x) = 23(2741)"*, 


d 
f'(z) = x (x 41) -*+327(27 +1)" 
Ha 
=e) (=a 41) (2a) 32" 41) 
= —2Qe*(4? + 1)~* + 3x?(2? +1)" 
—2a4 3x7 
= + 
(v2 41)? 2241 
_ —2a4 - 3x7 (x? + 1) 
a (x2 +1)? (x? +1)2 
_ —2a4+ 304432? 2t+ 32? 


(x? +1)? i (x? +1)? 


Note that we already had the derivative on the second line; all the rest is simplification. It 
is easier to get to this answer by using the quotient rule, so there’s a trade off: more work 


for fewer memorized formulas. 


Exercises 3.5. 


Find the derivatives of the functions. For extra practice, and to check your answers, do some of 
these in more than one way if possible. 


1. ot — 30° +(1/2)2? + 72-2 > 2. 2-2 4+4/es 
3. (2?+1) => A. 2/169 — «x? > 
5B. (a? —49 +5)V25-22 > 6. Vr?—<2?,r is a constant > 
i 
7. l+at= 8 : 
5 -— /x 
2 
+e 1) 
9. (1+32)? > 10. & 
(1+ 32) = 
— 72 
ia, 22 ce, a 
x x 
13. \/23 — 2? — (1/2) > 14. 100/(100 — x”)?/? = 
15. VYet+23 => 16. Ve? +1)? + 1+ (#241)? > 
17. («+38 > 18. (4-2)? > 
19. (#7 +5)? => 20. (6-227)? s 
21. (1—42°) 7 = 22. 5(a@+1-—1/2) > 
_ i 
23. 4(22?-2+3)?=> a 
Gri) 1+1/z 
= é 
re 26. 1)(5 — 2x) /2 
5 ee an a 6. (a +1)(5-22x)/2 > 
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27. 


29. 


31. 


33. 


35. 


36. 
37. 


38. 


39. 


40. 
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(327 + 1)(2¢ — 4)? = 28. 
ai 20 
= = => 32. 
(2a + NG — 3) i om 
(20 +1)?(27 +1) => 


Find an equation for the tangent line to f(x) = (x — 2)'/?/(a? + 4a — 1)? atw=1. > 


xr+1 

x—-1l 

(x D(e-2) 
xr—3 


3(x* + 1)(2x” — 1)(2e +3) > 


((Qe4+1)-*+3)* => 


Find an equation for the tangent line to y= 9x” at ol). 
Find an equation for the tangent line to (x? — 4x + 5)\/25 — x? at (3,8). > 


Find an equation for the tangent line to 


(2? +241) 


(i—2) 


at (2,—7). => 


Find an equation for the tangent line to Ve +1)? + 1+ (2? +1)? at (1,1/44+ V5). = 


A 


Transcendental Functions 


So far we have used only algebraic functions as examples when finding derivatives, that is, 
functions that can be built up by the usual algebraic operations of addition, subtraction, 
multiplication, division, and raising to constant powers. Both in theory and practice there 
are other functions, called transcendental, that are very useful. Most important among 
these are the trigonometric functions, the inverse trigonometric functions, exponential 


functions, and logarithms. 


4.1 TRIGONOMETRIC FUNCTIONS 


When you first encountered the trigonometric functions it was probably in the context of 
“triangle trigonometry,” defining, for example, the sine of an angle as the “side opposite 
over the hypotenuse.” While this will still be useful in an informal way, we need to use a 
more expansive definition of the trigonometric functions. First an important note: while 
degree measure of angles is sometimes convenient because it is so familiar, it turns out to 
be ill-suited to mathematical calculation, so (almost) everything we do will be in terms of 


radian measure of angles. 
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To define the radian measurement system, we consider the unit circle in the xy-plane: 


(cos x, sin x) 


(1, 0) 


An angle, x, at the center of the circle is associated with an arc of the circle which is said 
to subtend the angle. In the figure, this arc is the portion of the circle from point (1,0) 
to point A. The length of this arc is the radian measure of the angle x; the fact that the 
radian measure is an actual geometric length is largely responsible for the usefulness of 
radian measure. The circumference of the unit circle is 2mr = 27(1) = 27, so the radian 
measure of the full circular angle (that is, of the 360 degree angle) is 27. 

While an angle with a particular measure can appear anywhere around the circle, we 
need a fixed, conventional location so that we can use the coordinate system to define 
properties of the angle. The standard convention is to place the starting radius for the 
angle on the positive x-axis, and to measure positive angles counterclockwise around the 
circle. In the figure, x is the standard location of the angle 7/6, that is, the length of the 
arc from (1,0) to A is 7/6. The angle y in the picture is —7/6, because the distance from 
(1,0) to B along the circle is also 7/6, but in a clockwise direction. 

Now the fundamental trigonometric definitions are: the cosine of x and the sine of x 
are the first and second coordinates of the point A, as indicated in the figure. The angle x 
shown can be viewed as an angle of a right triangle, meaning the usual triangle definitions 
of the sine and cosine also make sense. Since the hypotenuse of the triangle is 1, the “side 
opposite over hypotenuse” definition of the sine is the second coordinate of point A over 
1, which is just the second coordinate; in other words, both methods give the same value 
for the sine. 

The simple triangle definitions work only for angles that can “fit” in a right triangle, 
namely, angles between 0 and 7/2. The coordinate definitions, on the other hand, apply 
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to any angles, as indicated in this figure: 


(cos x, sin x) 


The angle x is subtended by the heavy arc in the figure, that is, x = 77/6. Both 
coordinates of point A in this figure are negative, so the sine and cosine of 77/6 are both 


negative. 
The remaining trigonometric functions can be most easily defined in terms of the sine 


and cosine, as usual: 


sin x 
tan xz = 
COS x 
COS x 
cotxz = — 
sin x 
1 
sec xr = 
COS x 
1 
csc x = — 
sin x 


and they can also be defined as the corresponding ratios of coordinates. 

Although the trigonometric functions are defined in terms of the unit circle, the unit 
circle diagram is not what we normally consider the graph of a trigonometric function. 
(The unit circle is the graph of, well, the circle.) We can easily get a qualitatively correct 
idea of the graphs of the trigonometric functions from the unit circle diagram. Consider 
the sine function, y = sinxz. As x increases from 0 in the unit circle diagram, the second 
coordinate of the point A goes from 0 to a maximum of 1, then back to 0, then to a 
minimum of —1, then back to 0, and then it obviously repeats itself. So the graph of 
y = sinz must look something like this: 


—20 —3n/2 —1T —1/2 n/2 T 30/2 20 
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the point A goes from 1 to 0 then to —1, back to 0 and back to 1, so the graph of y = cos x 
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Similarly, as angle x increases from 0 in the unit circle diagram, the first coordinate of 


must look something like this: 


Exercises 4.1. 


Some useful trigonometric identities are in appendix B. 


i. 


. Find all values of @ such that cos(20) = 1/2; give your answer in radians. > 


Se oe 
- © 


CANO aA WwW bd 


Find all values of @ such that sin(@) = —1; give your answer in radians. > 


. Use an angle sum identity to compute cos(7/12). > 


Use an angle sum identity to compute tan(57/12). > 


. Verify the identity cos(t)/(1 — sin(t)) = 1+ sin(t). 

. Verify the identity 2 csc(20) = sec(@) csc(6). 

. Verify the identity sin(30) — sin(@) = 2cos(20) sin(6). 

. Sketch y = 2sin(z). 

. Sketch y = sin(3z). 

. Sketch y = sin(—2). 

. Find all of the solutions of 2sin(t) — 1 — sin?(t) = 0 in the interval [0,2]. > 


4.2 THE DERIVATIVE OF sinz 


What about the derivative of the sine function? The rules for derivatives that we have are 
no help, since sin x is not an algebraic function. We need to return to the definition of the 


derivative, set up a limit, and try to compute it. Here’s the definition: 


Using some trigonometric identities, 


sin(x + Ax) — sing 


—sinzx= hm 
dx Az—>0 Ax 


sin(a+Azx)—sing  sinzcosAz +sinAxcosz — sing 


Az Ax 
cos Ax — 1 sin Ax 
Nt 


COs & 
Ax Ax 


=sinz 


we can make a little progress on the quotient: 
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This isolates the difficult bits in the two limits 


cos Ax — 1 ; sin Ax 
ee im 
Az—-0 Ax Ax—->0 Ax 


Here we get a little lucky: it turns out that once we know the second limit the first is quite 
easy. The second is quite tricky, however. Indeed, it is the hardest limit we will actually 
compute, and we devote a section to it. 


4.3 A HARD LIMIT 


We want to compute this limit: 
sin Ax 


1m 
Az30 Ax 


Equivalently, to make the notation a bit simpler, we can compute 


In the original context we need to keep x and Az separate, but here it doesn’t hurt to 
rename Az to something more convenient. 

To do this we need to be quite clever, and to employ some indirect reasoning. The 
indirect reasoning is embodied in a theorem, frequently called the squeeze theorem. 


THEOREM 4.3.1 Squeeze Theorem Suppose that g(x) < f(x) < A(z) for all x 
close to a but not equal to a. If lim,_,, g(x) = L = lim,_,, h(a), then lim, _,, f(x) = L. 
= 


This theorem can be proved using the official definition of limit. We won’t prove it 
here, but point out that it is easy to understand and believe graphically. The condition 
says that f(x) is trapped between g(x) below and h(x) above, and that at x = a, both g 
and h approach the same value. This means the situation looks something like figure 4.3.1. 


? sin(/a), the upper and lower curves are x? and —x?. Since the 


The wiggly curve is x 
sine function is always between —1 and 1, —a? < x? sin(r/x) < x?, and it is easy to see 
that lim,_,9 —z? = 0 = lim,_,9 x”. It is not so easy to see directly, that is algebraically, 
that lim, .9 x? sin(7/ax) = 0, because the 7/x prevents us from simply plugging in x = 0. 
The squeeze theorem makes this “hard limit” as easy as the trivial limits involving x. 
To do the hard limit that we want, lim,_,9(sin x) /x, we will find two simpler functions 
g and h so that g(x) < (sinzx)/x < h(x), and so that lim, .9 g(x) = lim, 49 h(x). Not too 
surprisingly, this will require some trigonometry and geometry. Referring to figure 4.3.2, 
x is the measure of the angle in radians. Since the circle has radius 1, the coordinates of 
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Figure 4.3.1 The squeeze theorem. 


point A are (cos z,sin x), and the area of the small triangle is (cos x sin) /2. This triangle 
is completely contained within the circular wedge-shaped region bordered by two lines and 
the circle from (1,0) to point A. Comparing the areas of the triangle and the wedge we 
see (cosxzsinaz)/2 < x/2, since the area of a circular region with angle 0 and radius r is 
Or? /2. With a little algebra this turns into (sin)/ax < 1/cos2, giving us the h we seek. 


1s 


1 


Figure 4.3.2 Visualizing sin v/a. 


To find g, we note that the circular wedge is completely contained inside the larger 
triangle. The height of the triangle, from (1,0) to point B, is tanz, so comparing areas we 
get x/2 < (tanz)/2 = sinz/(2cosx). With a little algebra this becomes cos x < (sinz)/z. 


So now we have 
sin x 1 
< 


cosx < < . 
x COS x 
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Finally, the two limits lim, _,)9 cosx and lim,_,9 1/ cosa are easy, because cos(0) = 1. By 
the squeeze theorem, lim,_,9(sin x) /x = 1 as well. 

Before we can complete the calculation of the derivative of the sine, we need one other 
limit: 
cos xz — 1 


im 
x0 gL 


This limit is just as hard as sinx/z, but closely related to it, so that we don’t have to do 
a similar calculation; instead we can do a bit of tricky algebra. 


2 


cos x — l cosx—lcosx+1l cos* x — 1 —sin? x sinx sinzx 


i zg cosx+1l <a(cosr+1) 2(cosx+1) x cosxr+1 


To compute the desired limit it is sufficient to compute the limits of the two final fractions, 
as x goes to 0. The first of these is the hard limit we’ve just done, namely 1. The second 
turns out to be simple, because the denominator presents no problem: 


sin & sin 0 0 
im = = — =, 
z>0cosx+1 cosO+1 2 
Thus, 
cos xz — 1 
lim = 0 
xz—0 x 
Exercises 4.3. 
1. Compute lim sinibe) => 2. Compute lim sae) 
z30 x0 sin(22) 


an & 


t (4 
3. Compute lim co) 


t 
4. Compute lim 
2—0 csc(3x) P x0 


x 


5. Compute lim a 
a>n/4 cos(2z) 


6. For all x > 0, 42 —9 < f(x) < 2? — 4c +7. Find lim f(z). => 
xz 
7. For all x, 2x < g(x) < at — 2? +2. Find lim ga). => 
2 


8. Use the Squeeze Theorem to show that lim, x cos(2/x) = 0. 
2 
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4.4 THE DERIVATIVE OF sinz, CONTINUED 


Now we can complete the calculation of the derivative of the sine: 


sin(z + Ax) —sinz 


—sinz = lm 
dx Az—0 Ax 
i . cosAxr—1 ae sin Ax 
= lim sinx——— + cosz 
Az—-0 Ax Ax 


=sinx:-0+cosx:-1=cosz. 


The derivative of a function measures the slope or steepness of the function; if we 
examine the graphs of the sine and cosine side by side, it should be that the latter appears 
to accurately describe the slope of the former, and indeed this is true: 


1+ 1 


n/2 T 30/2 20 n/2 T 30/2 20 


sin x cos x 


Notice that where the cosine is zero the sine does appear to have a horizontal tangent 
line, and that the sine appears to be steepest where the cosine takes on its extreme values 
of 1 and —1. 

Of course, now that we know the derivative of the sine, we can compute derivatives of 


more complicated functions involving the sine. 


EXAMPLE 4.4.1 Compute the derivative of sin(x?). 


— sin(x*) = cos(x”) - 2a = 2x cos(x”). 


dx 


EXAMPLE 4.4.2 Compute the derivative of sin?(x? — 5a). 


us sin?(x* — 5a) = *(sin(a" —52x))? 


dx 
= 2(sin(z* — 5zx))1 cos(x? — 5x)(3x? —5) 


= 2(3n" — 5) cos(x* — 5a) sin(x? — 52). 
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Exercises 4.4. 


Find the derivatives of the following functions. 


1. sin?(./z) > 2. /xsing > 
Ss a x +2 
sin x sin x 


5. V1l—sin?x => 


4.5 DERIVATIVES OF THE TRIGONOMETRIC FUNCTIONS 


All of the other trigonometric functions can be expressed in terms of the sine, and so their 
derivatives can easily be calculated using the rules we already have. For the cosine we 
need to use two identities, 


T 
cos x = sin(a + 5 


T 
sinxz = —cos(x+ —). 
2 
Now 
‘ T T ‘ 
—cosxz = — sin(x + —) = cos(x + —)-1=-—sing 
dx dx 2 
d sing cos?#+sin? x 1 5 
— tanz = = 5 = 7 — Sec” © 
dx dx cos x& Cos? x COS? & 
d d = _ sin x 
— sec x = —(cosz)~! = —1(cosx)~?(—sinz) = 5 = Secxtanr 
dx dx COs? & 


The derivatives of the cotangent and cosecant are similar and left as exercises. 


Exercises 4.5. 


Find the derivatives of the following functions. 
1. sinxcosz > sin(cos xz) => 

3 eine = tanz/(1+sinz) > 

5. cotz > 

7. x sin(2327) > 

9. sin(cos(6x)) > 


csc xr => 


eo 


ees Py 
sin” x + cos’ x => 


sec 0 


10. C te —. ———_.. > 
ome de 1 + sec 6 
d 
11. Compute ae cos(6t). => 


d t® sin(3t) 
12. te — —__— . 
a a dt cos(2t) 


13. Find all points on the graph of f(x) = sin?(a) at which the tangent line is horizontal. > 


80 Chapter 4 Transcendental Functions 


14. Find all points on the graph of f(x) = 2sin(a)—sin?(«) at which the tangent line is horizontal. 
=> 


15. Find an equation for the tangent line to sin?(x) at « = 71/3. > 

16. Find an equation for the tangent line to sec? « at x = 7 13: = 

17. Find an equation for the tangent line to cos” # — sin?(4z) at = 7/6. > 

18. Find the points on the curve y = x+ 2cosz that have a horizontal tangent line. > 


19. Let C be a circle of radius r. Let A be an arc on C' subtending a central angle 0. Let B be 
the chord of C whose endpoints are the endpoints of A. (Hence, B also subtends 0.) Let 
s be the length of A and let d be the length of B. Sketch a diagram of the situation and 
compute lim s/d. 
60+ 


4.6 EXPONENTIAL AND LOGARITHMIC FUNCTIONS 


An exponential function has the form a”, where a is a constant; examples are 2”, 10”, e”. 
’ >] 5] Fy 


The logarithmic functions are the inverses of the exponential functions, that is, functions 
that “undo” the exponential functions, just as, for example, the cube root function “un- 
does” the cube function: V2? = 2. Note that the original function also undoes the inverse 
function: (V8)° = 8. 

Let f(x) = 2”. The inverse of this function is called the logarithm base 2, denoted 
log, (x) or (especially in computer science circles) lg(a). What does this really mean? The 
logarithm must undo the action of the exponential function, so for example it must be that 
lg(2°) = 3—starting with 3, the exponential function produces 2° = 8, and the logarithm 
of 8 must get us back to 3. A little thought shows that it is not a coincidence that lg(2°) 
simply gives the exponent—the exponent 7s the original value that we must get back to. 
In other words, the logarithm is the exponent. Remember this catchphrase, and what it 
means, and you won’t go wrong. (You do have to remember what it means. Like any 
good mnemonic, “the logarithm is the exponent” leaves out a lot of detail, like “Which 
exponent?” and “Exponent of what?” ) 


EXAMPLE 4.6.1 What is the value of log, (1000)? The “10” tells us the appropriate 
number to use for the base of the exponential function. The logarithm is the exponent, 
so the question is, what exponent E makes 10” = 1000? If we can find such an FE, then 
log9(1000) = log,)(10”) = E; finding the appropriate exponent is the same as finding the 


logarithm. In this case, of course, it is easy: E’ = 3 so log; (1000) = 3. 


Let’s review some laws of exponents and logarithms; let a be a positive number. Since 


a® =a-a-a-a-aand a? =a-a-a, it’s clear that a°-a® =a-a-a-a-a-a-a-a= a8 = a?t?, 


mtn 


and in general that aa” =a . Since “the logarithm is the exponent,” it’s no surprise 


that this translates directly into a fact about the logarithm function. Here are three facts 
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from the example: log,(a°) = 5, log,(a?) = 3, log,(a°®) = 8. So log,(a°a?) = log, (a®) = 
8 =5+3=log,(a°) + log,(a®). Now let’s make this a bit more general. Suppose A and 
B are two numbers, A = a*, and B = a". Then log,(AB) = log,(a7a") = log,(a*t¥) = 
x+y =log,(A) + log,(B). 

Now consider (a)? = a? - a®- a? = a?*°t? = a?? = a’. Again it’s clear that more 
generally (a’”)” = a"™", and again this gives us a fact about logarithms. If A = a” then 
AY = (a*)¥ = a®, so log, (A¥) = ry = ylog,(A)—the exponent can be “pulled out in 
front.” 

We have cheated a bit in the previous two paragraphs. It is obvious that a° = a-a-a-a-a 
and a® = a-a-a and that the rest of the example follows; likewise for the second example. 
But when we consider an exponential function a” we can’t be limited to substituting 
integers for x. What does a2? or a~!3 or a” mean? And is it really true that a?°a7!3 = 
a?°—1-32 The answer to the first question is actually quite difficult, so we will evade it; 
the answer to the second question is “yes.” 

We'll evade the full answer to the hard question, but we have to know something about 
exponential functions. You need first to understand that since it’s not “obvious” what 2” 
should mean, we are really free to make it mean whatever we want, so long as we keep the 
behavior that is obvious, namely, when « is a positive integer. What else do we want to 
be true about 2”? We want the properties of the previous two paragraphs to be true for 
all exponents: 272¥ = 27t¥ and (27)¥ = 27, 

After the positive integers, the next easiest number to understand is 0: 2° = 1. You 
have presumably learned this fact in the past; why is it true? It is true precisely because 
= 2° to be true about the function 2”. We need it to be true that 
2°92 — 2°+® — 9” and this only works if 2° = 1. The same argument implies that a? = 1 


we want 272° 


for any a. 

The next easiest set of numbers to understand is the negative integers: for example, 
2-3? = 1/2°. We know that whatever 2~* means it must be that 272° = 27379 = 2° = 1, 
which means that 2~? must be 1/2°. In fact, by the same argument, once we know what 
2” means for some value of x, 2~” must be 1/2” and more generally a~* = 1/a”. 

Next, consider an exponent 1/q, where q is a positive integer. We want it to be true 
that (27)¥ = 24, so (2'/2)4 = 2. This means that 21/7 is a q-th root of 2, 21/4 = W2. This 
is all we need to understand that 2?/4 = (21/9)P = (W/2 )P and a?/4 = (al/4)P = (Wa )?. 

What’s left is the hard part: what does 2” mean when x cannot be written as a 


fraction, like « = /2 or « = 7? What we know so far is how to assign meaning to 2” 
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whenever x = p/q; if we were to graph this we’d see something like this: 


But this is a poor picture, because you can’t see that the “curve” is really a whole lot 
of individual points, above the rational numbers on the x-axis. There are really a lot of 
“holes” in the curve, above x = 7, for example. But (this is the hard part) it is possible 
to prove that the holes can be “filled in”, and that the resulting function, called 2”, really 
does have the properties we want, namely that 272¥ = 277 and (27)¥ = 274, 


Exercises 4.6. 


1. Expand log,)((x + 45)" (a — 2)). 
3 
2. E d | 
xpand log, > 


ie 

3a —5+ (7/2) 

Write log, 32 + 17 log,(a — 2) — 2log, (x7 + 4a + 1) as a single logarithm. 
Solve log,(1+ Vz) = 6 for a. 

Solve 2” = 8 for x. 


Solve log, (log;(x)) = 1 for x. 


Sie ee 


4.7 DERIVATIVES OF THE EXPONENTIAL AND 
LOGARITHMIC FUNCTIONS 


As with the sine, we don’t know anything about derivatives that allows us to compute 
the derivatives of the exponential and logarithmic functions without going back to basics. 
Let’s do a little work with the definition again: 


d qztAr —qt 
—a* = lim ——— 
dx Ar—0 Ax 
: atqh® —qt 
= 1m 
Axz—-0 Ax 
Arc _ 1 
= lim a” 
Az—->0 Ax 
Ag _ 1 
=a” lim 
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There are two interesting things to note here: As in the case of the sine function we are left 
with a limit that involves Ax but not x, which means that whatever Jim (a —1)/Az 
t—> 


is, we know that it is a number, that is, a constant. This means that a” has a remarkable 
property: its derivative is a constant times itself. 
We earlier remarked that the hardest limit we would compute is lim, sinz/x = 1; we 
oe 


now have a limit that is just a bit too hard to include here. In fact the hard part is to 
see that Jim (a** — 1)/Az even exists—does this fraction really get closer and closer to 
I 


some fixed value? Yes it does, but we will not prove this fact. 

We can look at some examples. Consider (2 — 1)/ax for some small values of x: 1, 
0.828427124, 0.756828460, 0.724061864, 0.70838051, 0.70070877 when x is 1, 1/2, 1/4, 
1/8, 1/16, 1/32, respectively. It looks like this is settling in around 0.7, which turns out 
to be true (but the limit is not exactly 0.7). Consider next (3% — 1)/ax: 2, 1.464101616, 
1.264296052, 1.177621520, 1.13720773, 1.11768854, at the same values of x. It turns out 
to be true that in the limit this is about 1.1. Two examples don’t establish a pattern, but 
if you do more examples you will find that the limit varies directly with the value of a: 
bigger a, bigger limit; smaller a, smaller limit. As we can already see, some of these limits 
will be less than 1 and some larger than 1. Somewhere between a = 2 and a = 3 the limit 
will be exactly 1; the value at which this happens is called e, so that 

esr] 


lim 


i 
Azx—-0 Ax 


As you might guess from our two examples, e is closer to 3 than to 2, and in fact e = 2.718. 
Now we see that the function e” has a truly remarkable property: 


A 
d - ; ett x — et 
—_e = lim <=> 
dx Az—0 AG 
er eAt — et 
= hav 
Ax—-0 Ax 
Agr _ 1 
= lim e* 
Azx-0 Ax 
Agr _ 1 
=e" him 
Axz—-0 Ax 
= ev 


That is, e” is its own derivative, or in other words the slope of e* is the same as its height, 
or the same as its second coordinate: The function f(x) = e” goes through the point 
(z,e*) and has slope e* there, no matter what z is. It is sometimes convenient to express 
the function e* without an exponent, since complicated exponents can be hard to read. In 


such cases we use exp(), e.g., exp(1 + x”) instead of elt 
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What about the logarithm function? This too is hard, but as the cosine function was 
easier to do once the sine was done, so the logarithm is easier to do now that we know 
the derivative of the exponential function. Let’s start with log, x, which as you probably 
know is often abbreviated Inz and called the “natural logarithm” function. 

Consider the relationship between the two functions, namely, that they are inverses, 
that one “undoes” the other. Graphically this means that they have the same graph except 
that one is “flipped” or “reflected” through the line y = x, as shown in figure 4.7.1. 


Figure 4.7.1 The exponential and logarithm functions. 


This means that the slopes of these two functions are closely related as well: For example, 
the slope of e” is e at x = 1; at the corresponding point on the In(z) curve, the slope must 
be 1/e, because the “rise” and the “run” have been interchanged. Since the slope of e” is 
e at the point (1,e), the slope of In(z) is 1/e at the point (e, 1). 


Figure 4.7.2 Slope of the exponential and logarithm functions. 


More generally, we know that the slope of e” is e* at the point (z,e*), so the slope of 
In(a) is 1/e* at (e*, z), as indicated in figure 4.7.2. In other words, the slope of Inz is the 
reciprocal of the first coordinate at any point; this means that the slope of Inz at (x, Inz) 
is 1/x. The upshot is: 


—Inrz=-. 
x 


dx 
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We have discussed this from the point of view of the graphs, which is easy to understand 
but is not normally considered a rigorous proof—it is too easy to be led astray by pictures 
that seem reasonable but that miss some hard point. It is possible to do this derivation 
without resorting to pictures, and indeed we will see an alternate approach soon. 
Note that In x is defined only for x > 0. It is sometimes useful to consider the function 
In |x|, a function defined for z #0. When x < 0, In|z| = In(—2z) and 
d d 1 1 


re ln) = rT In(—x) = (-1) = 


Thus whether x is positive or negative, the derivative is the same. 

What about the functions a” and log, x? We know that the derivative of a” is some 
constant times a” itself, but what constant? Remember that “the logarithm is the expo- 
nent” and you will see that a = el™*. Then 


at = (en \" = erin? 
and we can compute the derivative using the chain rule: 


a By naye _ @ 
da” ~ dx dx 


elna _ (naje*** = (Ina)a”. 


The constant is simply Ina. Likewise we can compute the derivative of the logarithm 
function log, x. Since 


L=E 


we can take the logarithm base a of both sides to get 
log, (x) = log,(e™”) = Inz log, e. 


Then 


d 1 
— log. ¢ = = loge. 
gf 


dx 


This is a perfectly good answer, but we can improve it slightly. Since 


a= gina 


fae 


log, (a) = log, (e = Inalog, e 


1 = Inalog,e 


1 
Sees log, €, 
Ina 
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we can replace log, e to get 


d , re 
de ©" ~ aina’ 
You may if you wish memorize the formulas 
d d 1 
re =(Ina)a” and ce log, 2 = AR 


Because the “trick” a = e!™® is often useful, and sometimes essential, it may be better to 


remember the trick, not the formula. 


EXAMPLE 4.7.1 Compute the derivative of f(x) = 2”. 


d Oe pj 
= 9 ee n x 
dx ae ) 
-_ d xln2 
de 


= (=rmn2) grime 


= (no) S27 ie 


EXAMPLE 4.7.2 Compute the derivative of f(x) = 2” = 2"), 
d d 
2 —_ ae In 2 


dx dx 


= (2 In 2) ew In2 


=(2 ln 2)re”" we 


= (2In2)r2”" 


EXAMPLE 4.7.3. Compute the derivative of f(z) = x”. At first this appears to be 
a new kind of function: it is not a constant power of x, and it does not seem to be an 
exponential function, since the base is not constant. But in fact it is no harder than the 


previous example. 


4.8 Implicit Differentiation 87 


EXAMPLE 4.7.4 Recall that we have not justified the power rule except when the 
exponent is a positive or negative integer. We can use the exponential function to take 


care of other exponents. 


ad ri d rilngz 
dx ~—s dx 
d 
= (rine) gr ine 
1 Tr 
= (r—)z 
= ry"! 


Exercises 4.7. 


In 1-19, find the derivatives of the functions. 


19. 


20. 


21. 


4.8 


gt” _s 2 sin x 

er 
ey= 4. sin(e”) > 
sine _, 6. 2? = 
ee = 8. ©£+2° > 
(1/3)” => 10. eM /x > 
In(x® + 32) > 12. In(cos(x)) > 
af nie?) (a 14. In(sec(x) + tan(x)) > 
gos) 16. xclnz 

1+ In(3a? 
In(In(3z)) 18. ao 


a® (a — 23)1/? 
2726 (4x — 6)8 


Find the value of a so that the tangent line to y = In() at « = a is a line through the origin. 


Sketch the resulting situation. > 
If f(x) = n(x? + 2) compute f’(e'/?). 


IMPLICIT DIFFERENTIATION 


As we have seen, there is a close relationship between the derivatives of e” and In x because 


these functions are inverses. Rather than relying on pictures for our understanding, we 


would like to be able to exploit this relationship computationally. In fact this technique 


can help us find derivatives in many situations, not just when we seek the derivative of an 


inverse function. 
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We will begin by illustrating the technique to find what we already know, the derivative 
of Inz. Let’s write y = Ina and then « = e™® = e”, that is, c = e¥. We say that 
this equation defines the function y = Inzx implicitly because while it is not an explicit 
expression y = ..., it is true that if c = e” then y is in fact the natural logarithm function. 
Now, for the time being, pretend that all we know of y is that x = e”; what can we say 
about derivatives? We can take the derivative of both sides of the equation: 


Ge BAe ay 
dx dx 


Then using the chain rule on the right hand side: 
d 
ig— (=) e4 = ye. 


_ a! 
oS oy eae 


Then we can solve for y’: 


There is one little difficulty here. To use the chain rule to compute d/dz(e”) = y’e” we 
need to know that the function y has a derivative. All we have shown is that if it has a 
derivative then that derivative must be 1/x. When using this method we will always have 
to assume that the desired derivative exists, but fortunately this is a safe assumption for 
most such problems. 

The example y = In involved an inverse function defined implicitly, but other func- 
tions can be defined implicitly, and sometimes a single equation can be used to implicitly 
2 2g eae 
describes a circle of radius r. The circle is not a function y = f(a) because for some values 


define more than one function. Here’s a familiar example. The equation r 


of x there are two corresponding values of y. If we want to work with a function, we can 
break the circle into two pieces, the upper and lower semicircles, each of which is a function. 
Let’s call these y = U(x) and y = L(x); in fact this is a fairly simple example, and it’s 
possible to give explicit expressions for these: U(x) = Vr? — x? and L(x) = —Vr?-2?. 
But it’s somewhat easier, and quite useful, to view both functions as given implicitly by 
r? =a? +y": both r? = 2? 4+ U(x)? and r? = 2? + L(x)? are true, and we can think of 
r? =a? +y? as defining both U(x) and L(2). 

Now we can take the derivative of both sides as before, remembering that y is not 
simply a variable but a function—in this case, y is either U(x) or L(x) but we’re not yet 
specifying which one. When we take the derivative we just have to remember to apply the 
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chain rule where y appears. 


£? = S(t) 
0 = 2x + 2yy' 
f~S2e se 
rear 


Now we have an expression for y’, but it contains y as well as x. This means that if we 
want to compute y’ for some particular value of x we’ll have to know or compute y at that 
value of x as well. It is at this point that we will need to know whether y is U(x) or L(x). 
Occasionally it will turn out that we can avoid explicit use of U(x) or L(«) by the nature 
of the problem 


EXAMPLE 4.8.1 Find the slope of the circle 4 = x? +? at the point (1, —V3). Since 
we know both the x and y coordinates of the point of interest, we do not need to explicitly 
recognize that this point is on L(x), and we do not need to use L(x) to compute y—but 
we could. Using the calculation of y’ from above, 


; a 1 1 
Y = —_— = — — - 
y -V3 V3 
It is instructive to compare this approach to others. 


We might have recognized at the start that (1,—V/3) is on the function y = L(x) = 


— /4— 27. We could then take the derivative of L(x), using the power rule and the chain 
rule, to get 


x 
Jae 


Then we could compute L'(1) = 1/V3 by substituting x = 1. 
Alternately, we could realize that the point is on L(x), but use the fact that y/ = —a/y. 


(a) = —5(4— 2?) (-25) = 


Since the point is on L(x) we can replace y by L(x) to get 


; My ay 


EGY afte 


without computing the derivative of L(x) explicitly. Then we substitute z = 1 and get the 


same answer as before. 


In the case of the circle it is possible to find the functions U(x) and L(x) explicitly, but 
there are potential advantages to using implicit differentiation anyway. In some cases it is 
more difficult or impossible to find an explicit formula for y and implicit differentiation is 
the only way to find the derivative. 
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EXAMPLE 4.8.2 Find the derivative of any function defined implicitly by yx? +e¥ = x. 
We treat y as an unspecified function and use the chain rule: 


d d 
ae +e4) = im 
(y-2n+4' +27) +y'e% =1 
yz +y'e¥ =1—2ry 
y' (a* +e”) =1—-2Qry 
= Li 2a 
x? + e¥ 


You might think that the step in which we solve for y’ could sometimes be difficult— 
after all, we’re using implicit differentiation here because we can’t solve the equation 
yx? + e¥ =x for y, so maybe after taking the derivative we get something that is hard to 
solve for y’. In fact, this never happens. All occurrences y’ come from applying the chain 
rule, and whenever the chain rule is used it deposits a single y’ multiplied by some other 
expression. So it will always be possible to group the terms containing y’ together and 
factor out the y’, just as in the previous example. If you ever get anything more difficult 
you have made a mistake and should fix it before trying to continue. 

It is sometimes the case that a situation leads naturally to an equation that defines a 
function implicitly. 


EXAMPLE 4.8.3 Consider all the points (x, y) that have the property that the distance 
from (x,y) to (x1, y1) plus the distance from (x,y) to (2, y2) is 2a (a is some constant). 
These points form an ellipse, which like a circle is not a function but can viewed as two 
functions pasted together. Because we know how to write down the distance between two 
points, we can write down an implicit equation for the ellipse: 


J (a — £1)? + (y — yi)? + V (@ — 22)? + (y — 92)? = 2a. 


Then we can use implicit differentiation to find the slope of the ellipse at any point, though 


the computation is rather messy. 


EXAMPLE 4.8.4 We have already justified the power rule by using the exponential 
function, but we could also do it for rational exponents by using implicit differentiation. 


m/n 


Suppose that y = 2"’", where m and n are positive integers. We can write this implicitly 


as y” = x"”, then because we justified the power rule for integers, we can take the derivative 
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of each side: 


n 1 


ny” ty! = ma" 


; mam! 
QS ma 
n yr 
; m giml 
y = 


_ n (am/n)n—1 


iran Mm m—1—(m/n)(n-1) 


n 
! m 


y= — m—1—-m+(m/n) 


© 
| 


nm 
/ mM (m/n)—1 
m 


Exercises 4.8. 


In exercises 1-8, find a formula for the derivative y’ at the point (2, y): 
1. y” =l4+277s 

2. e+aeyt+y =i = 

3. z+ ay” =y + yx" => 

A. Acosxsiny=1=> 

5. Jat+/y=9> 

6. tan(z/y) =x“#+y> 

7. sin(g+y)=2y> 

8 

9 


1 1 
. 245575 
ey 


. A hyperbola passing through (8,6) consists of all points whose distance from the origin is a 
constant more than its distance from the point (5,2). Find the slope of the tangent line to 
the hyperbola at (8,6). > 


10. Compute y’ for the ellipse of example 4.8.3. 
11. If y =log, x then a¥ = x. Use implicit differentiation to find y’. 


12. The graph of the equation x” — ry +y” = 9 is an ellipse. Find the lines tangent to this curve 
at the two points where it intersects the z-axis. Show that these lines are parallel. > 


13. Repeat the previous problem for the points at which the ellipse intersects the y-axis. => 

14. Find the points on the ellipse from the previous two problems where the slope is horizontal 
and where it is vertical. > 

15. Find an equation for the tangent line to «* = y+.” at (2, V12). (This curve is the kampyle 
of Eudoxus.) => 

16. Find an equation for the tangent line to 2?/% + y?/° = a?’ at a point (1,41) on the curve, 
with x; #0 and y; #0. (This curve is an astroid.) > 


17. Find an equation for the tangent line to (x7 +y’)? = a” —y’ at a point (1, y1) on the curve, 
when yi # 0. (This curve is a lemniscate.) > 
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Definition. Two curves are orthogonal if at each point of intersection, the angle between 
their tangent lines is 7/2. Two families of curves, A and B, are orthogonal trajectories of each 
other if given any curve C in A and any curve D in 6 the curves C and D are orthogonal. For 
example, the family of horizontal lines in the plane is orthogonal to the family of vertical lines in 
the plane. 


18. Show that x? — y? = 5 is orthogonal to 4x” + 9y? = 72. (Hint: You need to find the 
intersection points of the two curves and then show that the product of the derivatives at 
each intersection point is —1.) 


19. Show that «7 +y? =r? is orthogonal to y = mz. Conclude that the family of circles centered 
at the origin is an orthogonal trajectory of the family of lines that pass through the origin. 


Note that there is a technical issue when m = 0. The circles fail to be differentiable 
when they cross the x-axis. However, the circles are orthogonal to the x-axis. Explain why. 
Likewise, the vertical line through the origin requires a separate argument. 


20. Fork £0 andc¥0 show that y” — 2” = k is orthogonal to yx = c. In the case where k and 
c are both zero, the curves intersect at the origin. Are the curves y” — x? = 0 and yx = 0 
orthogonal to each other? 


21. Suppose that m 4 0. Show that the family of curves {y = mx +b | b € R} is orthogonal to 
the family of curves {y = —(x/m)+c|ceéR}. 


4.9 INVERSE TRIGONOMETRIC FUNCTIONS 


The trigonometric functions frequently arise in problems, and often it is necessary to 
invert the functions, for example, to find an angle with a specified sine. Of course, there 
are many angles with the same sine, so the sine function doesn’t actually have an inverse 
that reliably “undoes” the sine function. If you know that sinx = 0.5, you can’t reverse 
this to discover x, that is, you can’t solve for x, as there are infinitely many angles with 
sine 0.5. Nevertheless, it is useful to have something like an inverse to the sine, however 
imperfect. The usual approach is to pick out some collection of angles that produce all 
possible values of the sine exactly once. If we “discard” all other angles, the resulting 
function does have a proper inverse. 

The sine takes on all values between —1 and 1 exactly once on the interval [—1/2, 7/2]. 
If we truncate the sine, keeping only the interval [—7/2, 7/2], as shown in figure 4.9.1, then 
this truncated sine has an inverse function. We call this the inverse sine or the arcsine, 
and write y = arcsin(x). 

Recall that a function and its inverse undo each other in either order, for example, 
(%/x)*? = x and V3 = x. This does not work with the sine and the “inverse sine” because 
the inverse sine is the inverse of the truncated sine function, not the real sine function. 
It is true that sin(arcsin(#)) = x, that is, the sine undoes the arcsine. It is not true that 
the arcsine undoes the sine, for example, sin(57/6) = 1/2 and arcsin(1/2) = 7/6, so doing 
first the sine then the arcsine does not get us back where we started. This is because 57/6 
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—20 —31/2 —1 —n/2 m/2 T 31/2 Qn 


nm /2 —1 1 


Lxr/2 4 
Figure 4.9.1 ‘The sine, the truncated sine, the inverse sine. 


is not in the domain of the truncated sine. If we start with an angle between —7/2 and 
7/2 then the arcsine does reverse the sine: sin(7/6) = 1/2 and arcsin(1/2) = 7/6. 

What is the derivative of the arcsine? Since this is an inverse function, we can discover 
the derivative by using implicit differentiation. Suppose y = arcsin(xz). Then 


sin(y) = sin(arcsin(x)) = x. 
Now taking the derivative of both sides, we get 


y’ cosy = 1 
; 1 


cos y 


As we expect when using implicit differentiation, y appears on the right hand side here. 
We would certainly prefer to have y’ written in terms of x, and as in the case of Inx we 
can actually do that here. Since sin? y + cos? y = 1, cos?y = 1—sin?y = 1— 27. So 
cosy = £1 — 2? , but which is it—plus or minus? It could in general be either, but this 
isn’t “in general”: since y = arcsin(x) we know that —7/2 < y < 7/2, and the cosine of 


an angle in this interval is always positive. Thus cosy = V1— a? and 


1 


Note that this agrees with figure 4.9.1: the graph of the arcsine has positive slope every- 


ae arcsin(xz) = 


where. 
We can do something similar for the cosine. As with the sine, we must first truncate 
the cosine so that it can be inverted, as shown in figure 4.9.2. Then we use implicit 
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T 4 


-1 i 


Figure 4.9.2 The truncated cosine, the inverse cosine. 
differentiation to find that 


V1l— a? 
Note that the truncated cosine uses a different interval than the truncated sine, so that if 
y = arccos(x) we know that 0 < y < 7. The computation of the derivative of the arccosine 
is left as an exercise. 

Finally we look at the tangent; the other trigonometric functions also have “partial 
inverses” but the sine, cosine and tangent are enough for most purposes. The tangent, 
truncated tangent and inverse tangent are shown in figure 4.9.3; the derivative of the 


arctangent is left as an exercise. 


m/2 
ei a 7 ee of 
—n/2 


Figure 4.9.3 The tangent, the truncated tangent, the inverse tangent. 


Exercises 4.9. 


1. Show that the derivative of arccos zx is — 


1 


2. Show that the derivative of arctan z is : 
1+ 2? 
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3. The inverse of cot is usually defined so that the range of arccot is (0,7). Sketch the graph 
of y = arccot x. In the process you will make it clear what the domain of arccot is. Find the 
derivative of the arccotangent. > 


. Show that arccot x + arctanxz = 7/2. 


. Find the derivative of arcsin(x?). > 


4 
5 
6. Find the derivative of arctan(e”). > 
7. Find the derivative of arccos(sin 2°) > 
8. Find the derivative of In((arcsin x)”) > 
9. Find the derivative of arccose” => 
10. Find the derivative of arcsin x + arccos xz => 


11. Find the derivative of log;(arctan(x”)) > 


4.10 LIMITS REVISITED 


We have defined and used the concept of limit, primarily in our development of the deriva- 
tive. Recall that lim f(a) = L is true if, in a precise sense, f(a) gets closer and closer to 
La 


L as x gets closer and closer to a. While some limits are easy to see, others take some 
ingenuity; in particular, the limits that define derivatives are always difficult on their face, 


since in 
sin fet Ae) = F(a) 
Az—>0 Ax 


both the numerator and denominator approach zero. Typically this difficulty can be re- 
solved when f is a “nice” function and we are trying to compute a derivative. Occasionally 
such limits are interesting for other reasons, and the limit of a fraction in which both nu- 
merator and denominator approach zero can be difficult to analyze. Now that we have 
the derivative available, there is another technique that can sometimes be helpful in such 
circumstances. 

Before we introduce the technique, we will also expand our concept of limit, in two 
ways. When the limit of f(x) as x approaches a does not exist, it may be useful to note in 
what way it does not exist. We have already talked about one such case: one-sided limits. 
Another case is when “f goes to infinity”. We also will occasionally want to know what 
happens to f when x “goes to infinity”. 


EXAMPLE 4.10.1 What happens to 1/xz as x goes to 0? From the right, 1/2 gets 
bigger and bigger, or goes to infinity. From the left it goes to negative infinity. 


EXAMPLE 4.10.2 What happens to the function cos(1/xz) as x goes to infinity? It 
seems clear that as x gets larger and larger, 1/x gets closer and closer to zero, so cos(1/z) 


should be getting closer and closer to cos(0) = 1. 
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As with ordinary limits, these concepts can be made precise. Roughly, we want 
lim f(x) = co to mean that we can make f(x) arbitrarily large by making x close enough 
wa 


to a, and lim f(«) = L should mean we can make f(x) as close as we want to L by 
xL—- Oo 


making x large enough. Compare this definition to the definition of limit in section 2.3, 
definition 2.3.2. 


DEFINITION 4.10.3 If f is a function, we say that lim f(z) = oo if for every N > 0 
wa 


there is a 6 > 0 such that whenever |x — a] < 6, f(z) > N. We can extend this in the 
obvious ways to define lim f(x) = —oo, lim f(x) = too, and lim, Fe = 00: 
La La La 


DEFINITION 4.10.4 Limit at infinity If f isa function, we say that lim f(x) = L 
LOO 


if for every € > 0 there is an N > 0 so that whenever x > N, |f(x) — L| < «. We may 
similarly define lim f(x) = ZL, and using the idea of the previous definition, we may 
«L—-—Co 


define jim Fie) = 200. 


We include these definitions for completeness, but we will not explore them in detail. 
Suffice it to say that such limits behave in much the same way that ordinary limits do; in 
particular there are some analogs of theorem 2.3.6. 

Now consider this limit: 


zon sina 


As x approaches 7, both the numerator and denominator approach zero, so it is not 
obvious what, if anything, the quotient approaches. We can often compute such limits by 
application of the following theorem. 


THEOREM 4.10.5 L’H6pital’s Rule — For “sufficiently nice” functions f(x) and 
g(x), if lim f(x) = 0 = lim g(z) or both lim f(x) = too and lim,-44 g(x) = +o, and if 


/ / 
oe 
i cs) exists, then lim fl) = lim mt ) This remains true if “a — a” is replaced by 
“% — co” or “x + —0o”. = 


This theorem is somewhat difficult to prove, in part because it incorporates so many 
different possibilities, so we will not prove it here. We also will not need to worry about 
the precise definition of “sufficiently nice”, as the functions we encounter will be suitable. 

pa 
EXAMPLE 4.10.6 Compute lim 


r>r 6SiInav 


in two ways. 
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First we use L’Hopital’s Rule: Since the numerator and denominator both approach 
zero, 


: 2K. 
= lim r 
rm COS XL 


ron singe 
provided the latter exists. But in fact this is an easy limit, since the denominator now 
approaches —1, so 


- = —27. 
rrr SIND —] 


—T 
lim (2 + 7)— 
LOT sin v 
and note that 
_ “on LT 
lim — = | - = lim —— 
con sing «onr—sin(g—m) 230 sing 


since x — 7 approaches zero as x approaches 7. Now 


lim (a + t) " = lim (x +7) lim — iat Qn(—1) = —20 
w>T sin v wT z—->0 sinzx 


as before. 


Qn? — 32 4+7 
EXAMPER A107 “Computettin. = | oan: 
so g2+47r4+1 


As x goes to infinity both the numerator and denominator go to infinity, so we may 
apply L’Hopital’s Rule: 


Qn? — 3a +7 
lim = 


4r — 3 
e300 22 +474 +1 © 


ios PAT 


In the second quotient, it is still the case that the numerator and denominator both go to 
infinity, so we are allowed to use L’H6pital’s Rule again: 


4e-3 _ 4 9 


So the original limit is 2 as well. 


Again, we don’t really need L’Ho6pital’s Rule, and in fact a more elementary approach 
is easier—we divide the numerator and denominator by «7: 


Qn? — 382 +7 Qe? -38r+7a— |. 2-343 
im >, — = 1m = 
woo x2 4+ 47¢+1 ero a? +474 +14 


Now as x approaches infinity, all the quotients with some power of x in the denominator 


approach zero, leaving 2 in the numerator and 1 in the denominator, so the limit again is 
2: 
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EXAMPLE 4.10.8 Compute lim pee >t. 
z>0 sing 


Both the numerator and denominator approach zero, so applying L’Hopital’s Rule: 


seca — 1 _ secxtanx 1-0 


x>0 sing z>0 cosx 1 


EXAMPLE 4.10.9 Compute lim a Ine. 
xz—0 


This doesn’t appear to be suitable for L’H6pital’s Rule, but it also is not “obvious”. 
As «x approaches zero, Inz goes to —oo, so the product looks like (something very small) - 
(something very large and negative). But this could be anything: it depends on how small 
and how large. For example, consider (x*)(1/2), (a)(1/a), and (x)(1/x”). As x approaches 
zero, each of these is (something very small) - (something very large), yet the limits are 
respectively zero, 1, and oo. 

We can in fact turn this into a L’Hopital’s Rule problem: 


Ina _ Ina 


ic 
ea jg oat 


Now as x approaches zero, both the numerator and denominator approach infinity (one 
—oo and one +00, but only the size is important). Using L’H6pital’s Rule: 
Inz ; lig 


lim —; = lim = lim —(—2z”) = lim —r =0. 
20+ DT 20+ —27 z—0t & x—0+ 


One way to interpret this is that since lim, xlnax = 0, the x approaches zero much faster 
xz—0 


than the In xz approaches —oo. 


Exercises 4.10. 


Compute the limits. 


1. lim et = 2. lim => 
xz—0 sin x zZ—co 
: . Ina 
3. lim Je2+2—-V22-2 > 4. lim — > 
woo zoo 
5. lim — = 6. li lbs 
xr 0o x woo EF — Et 
da So. Tig, AO ng 
20 x toit t?—2t4+1 
= _ = 3 
GT ee ce ie ae 
72 4-2 too 3t+12—1/t? 
1 = _ 
ii: WEEE te 1s te 


yoo UT] xl ye —1 


13. 


15. 


17. 


19. 


21. 


23. 


25. 


27. 


29. 


31. 


33. 


35. 


37. 


39. 


Al. 


43. 


45. 


AT. 


49. 


1 1 
li —-4—)(Vt+1-1ls> 
tin, (G+ Ze) . ) 
(u—1)° 
] 
ui (1/u) —u2+3u—3 


lim Pe IVe => 
“2—Ot 241//Sx 


de 
oo 2/3 + q/4 


t 
: WS t—1 
lim ——— => 
too 1 — t 
t—1 
COS x 


li — 
27/2 (1/2) —a 


xz—0 x 
sin(22) 
im —-— 
xr—0 In(a + 1) 
lim ve 
zit x 1 


ogee 


=> 


lim 2te 
xLZ—-> co 1 + Qa71 


1 41 
; 
acy a (e+s5) = 


x? — 6x —2 


14. 


18. 


20. 


22. 


24. 
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t—0 


Him (+ $) (4 t)/? 8) 


a? 


lim ————_ = 
20 /ig-F1—1 

a a (1) 

] ao Ns 
z+) 3 — (2/2) 


lim 34a +a" 
ez 3Ot+ 2 + 4-1/2 


Vo 


i xing 
x21 2 — 1 
grea 


: +a? 
lim —— 
LZ Co 2x + med 

Ax 


1 1 
li 5 aad 
tim + 8)(35+ 55) 


x? — 6x —2 
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x 
50. The function f(x) = has two horizontal asymptotes. Find them and give a rough 
eo? 1 


sketch of f with its horizontal asymptotes. => 


4.11 HYPERBOLIC FUNCTIONS 


The hyperbolic functions appear with some frequency in applications, and are quite similar 
in many respects to the trigonometric functions. This is a bit surprising given our initial 
definitions. 


DEFINITION 4.11.1 The hyperbolic cosine is the function 


2 
and the hyperbolic sine is the function 
zo _ e—# 
sinh xz = 
2 


Notice that cosh is even (that is, cosh(—ax) = cosh(a)) while sinh is odd (sinh(—x) = 
—sinh(z)), and cosha + sinha = e*. Also, for all x, cosha > 0, while sinha = 0 if and 
only if e” — e-* =0, which is true precisely when x = 0. 


LEMMA 4.11.2 The range of cosh is [1, 00). 


Proof. Let y=coshz. We solve for a: 


ere 
an, 

2y=e*+e” 
Qye* = e7* +1 


fav) 
8 
| 
< 
H- 
s NO 
| 
—_ 


From the last equation, we see y? > 1, and since y > 0, it follows that y > 1. 


Now suppose y > 1, so yy? —1>0. Then x = In(y+ Vy? — 1) is a real number, 
and y = coshz, so y is in the range of cosh(z). = 
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DEFINITION 4.11.3 The other hyperbolic functions are 


sinh x 
tanhz = 
cosh x 
cosh x 
cotha = — 
sinh x 
1 
sech az = ——— 
cosh x 
1 
cschz = 
sinh x 


The domain of coth and csch is x 4 0 while the domain of the other hyperbolic functions 


is all real numbers. Graphs are shown in figure 4.11.1 


4 4 4 5 
3 
2 2 2 
—2-10 1 2 = 2 -2 
—4' —4\J 


Figure 4.11.1 The hyperbolic functions: cosh, sinh, tanh, sech, csch, coth. 


Certainly the hyperbolic functions do not closely resemble the trigonometric functions 
graphically. But they do have analogous properties, beginning with the following identity. 


THEOREM 4.11.4 For all x in R, cosh? x — sinh? x = 1. 


Proof. The proof is a straightforward computation: 


5 ; ‘ (e7 he ee (e* _ me @ e2 4. 9 e72e _ e2 a e72z 4 
_ = a = = = 1. 
cosh* «—sinh* x r z A 1 


This immediately gives two additional identities: 
1 — tanh? x = sech? x and coth? x — 1 = csch? 2. 


The identity of the theorem also helps to provide a geometric motivation. Recall that 
the graph of x? — y? = 1 is a hyperbola with asymptotes x = ty whose x-intercepts are 
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+1. If (x,y) is a point on the right half of the hyperbola, and if we let x = cosht, then 


y = +V 22 —1 = +Vcosh?t—1 = +sinht. So for some suitable t, cosht and sinht are 
the coordinates of a typical point on the hyperbola. In fact, it turns out that t is twice 
the area shown in the first graph of figure 4.11.2. Even this is analogous to trigonometry; 
cost and sint are the coordinates of a typical point on the unit circle, and t is twice the 
area shown in the second graph of figure 4.11.2. 


(cos t, sin t) 


(cosh t, sinh t) May 


Figure 4.11.2 Geometric definitions of sin, cos, sinh, cosh: t is twice the shaded area in 
each figure. 


Given the definitions of the hyperbolic functions, finding their derivatives is straight- 


forward. Here again we see similarities to the trigonometric functions. 


THEOREM 4.11.5 7 cosh x = sinhz and be sinh x = cosh. 
ay v 


d d x =i 2 go d d Z_ pt 
Proof. in cosh x = in c = mn > = sinh z, and a sinha = —— = 
—— = cosh z. = 


Since cosha > 0, sinh is increasing and hence injective, so sinhxz has an inverse, 
arcsinh z. Also, sinha > 0 when x > 0, so cosh is injective on [0, 00) and has a (partial) 
inverse, arccosh x. The other hyperbolic functions have inverses as well, though arcsech x 
is only a partial inverse. We may compute the derivatives of these functions as we have 


other inverse functions. 


il 


d 
THEOREM 4.11.6 —arcsinhx = ———. 
di vive 


d 
Proof. Let y = arcsinhz, so sinhy = x. Then qq Sinhy = cosh(y)- y’ = 1, and so 
£ 
1 1 


1 
~ coshy J/1+sinh? y fifa? 


/ 
| 
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The other derivatives are left to the exercises. 


Exercises 4.11. 


1. 


Show that the range of sinh is all real numbers. (Hint: show that if y = sinha then 


t= In(y+ Vy? + 1).) 


. Compute the following limits: 


a. lim coshz 
woo 


b. lim sinhz 


xw—-Co 


c. lim tanhz 
woo 


d. lim (cosh — sinh z) 
xwL—-Co 


. Show that the range of tanhz is (—1,1). What are the ranges of coth, sech, and csch? (Use 


the fact that they are reciprocal functions.) 


. Prove that for every x,y € R, sinh(x + y) = sinhacoshy + coshasinhy. Obtain a similar 


identity for sinh(x — y). 


. Prove that for every x,y € R, cosh(x + y) = coshacoshy + sinhasinhy. Obtain a similar 


identity for cosh(x — y). 


. Use exercises 4 and 5 to show that sinh(2x) = 2sinh a cosh and cosh(2a) = cosh” x+sinh? x 


for every x. Conclude also that (cosh(2x) — 1)/2 = sinh” «. 


d 
. Show that qq (tanh x) = sech? x. Compute the derivatives of the remaining hyperbolic 


functions as well. 


. What are the domains of the six inverse hyperbolic functions? 


. Sketch the graphs of all six inverse hyperbolic functions. 


o 


Curve Sketching 


Whether we are interested in a function as a purely mathematical object or in connection 
with some application to the real world, it is often useful to know what the graph of 
the function looks like. We can obtain a good picture of the graph using certain crucial 
information provided by derivatives of the function and certain limits. 


5.1 MAXIMA AND MINIMA 


A local maximum point on a function is a point (z,y) on the graph of the function 
whose y coordinate is larger than all other y coordinates on the graph at points “close 
to” (x,y). More precisely, (x, f(x)) is a local maximum if there is an interval (a,b) with 
a<a«<band f(x) > f(z) for every z in (a,b). Similarly, (z,y) is a local minimum 
point if it has locally the smallest y coordinate. Again being more precise: (x, f(x)) is a 
local minimum if there is an interval (a,b) with a < x < band f(x) < f(z) for every z in 
(a,b). A local extremum is either a local minimum or a local maximum. 

Local maximum and minimum points are quite distinctive on the graph of a function, 
and are therefore useful in understanding the shape of the graph. In many applied problems 
we want to find the largest or smallest value that a function achieves (for example, we might 
want to find the minimum cost at which some task can be performed) and so identifying 
maximum and minimum points will be useful for applied problems as well. Some examples 
of local maximum and minimum points are shown in figure 5.1.1. 

If (x, f(x)) is a point where f(a) reaches a local maximum or minimum, and if the 
derivative of f exists at x, then the graph has a tangent line and the tangent line must be 
horizontal. This is important enough to state as a theorem, though we will not prove it. 
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Figure 5.1.1 Some local maximum points (A) and minimum points (B). 


THEOREM 5.1.1 Fermat’s Theorem If f(x) has a local extremum at x = a and 
f is differentiable at a, then f’(a) = 0. rT 


Thus, the only points at which a function can have a local maximum or minimum 
are points at which the derivative is zero, as in the left hand graph in figure 5.1.1, or the 
derivative is undefined, as in the right hand graph. Any value of x for which f’(zx) is zero or 
undefined is called a critical value for f. When looking for local maximum and minimum 
points, you are likely to make two sorts of mistakes: You may forget that a maximum or 
minimum can occur where the derivative does not exist, and so forget to check whether 
the derivative exists everywhere. You might also assume that any place that the derivative 
is zero is a local maximum or minimum point, but this is not true. A portion of the graph 
of f(x) = 2° is shown in figure 5.1.2. The derivative of f is f’(x) = 3a?, and f’(0) = 0, 


but there is neither a maximum nor minimum at (0,0). 


Figure 5.1.2 No maximum or minimum even though the derivative is zero. 


Since the derivative is zero or undefined at both local maximum and local minimum 
points, we need a way to determine which, if either, actually occurs. The most elementary 
approach, but one that is often tedious or difficult, is to test directly whether the y coor- 


dinates “near” the potential maximum or minimum are above or below the y coordinate 
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at the point of interest. Of course, there are too many points “near” the point to test, but 
a little thought shows we need only test two provided we know that f is continuous (recall 
that this means that the graph of f has no jumps or gaps). 

Suppose, for example, that we have identified three points at which f’ is zero or 
nonexistent: (71,41), (2,2), (%3, y3), and 71 < xq < x3 (see figure 5.1.3). Suppose that 
we compute the value of f(a) for 71 < a < 22, and that f(a) < f(g). What can we say 
about the graph between a and x2? Could there be a point (b, f(b)), a < 6 < x2 with 
f(b) > f(v2)? No: if there were, the graph would go up from (a, f(a)) to (b, f(b)) then 
down to (x2, f(x2)) and somewhere in between would have a local maximum point. (This 
is not obvious; it is a result of the Extreme Value Theorem, theorem 6.1.2.) But at that 
local maximum point the derivative of f would be zero or nonexistent, yet we already 
know that the derivative is zero or nonexistent only at 71, %2, and x3. The upshot is that 
one computation tells us that (x2, f(#2)) has the largest y coordinate of any point on the 
graph near x2 and to the left of x2. We can perform the same test on the right. If we find 
that on both sides of 72 the values are smaller, then there must be a local maximum at 
(x2, f(x2)); if we find that on both sides of x the values are larger, then there must be a 
local minimum at (x2, f(x2)); if we find one of each, then there is neither a local maximum 


or minimum at Zo. 


ria 6b Z2 x3 


Figure 5.1.3 Testing for a maximum or minimum. 


It is not always easy to compute the value of a function at a particular point. The 
task is made easier by the availability of calculators and computers, but they have their 
own drawbacks—they do not always allow us to distinguish between values that are very 
close together. Nevertheless, because this method is conceptually simple and sometimes 
easy to perform, you should always consider it. 


EXAMPLE 5.1.2 Find all local maximum and minimum points for the function f(x) = 
x? —a. The derivative is f(x) = 327 — 1. This is defined everywhere and is zero at 
z = +/3/3. Looking first at « = 3/3, we see that f(/3/3) = —2V3/9. Now we test 
two points on either side of « = V3/ 3, making sure that neither is farther away than 
the nearest critical value; since V3 < 3, V3/3 < 1 and we can use x = 0 and x = 1. 
Since f(0) = 0 > —2V3/9 and f(1) = 0 > —2V3/9, there must be a local minimum at 
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a = V3/3. For « = —V3/3, we see that f(—V3/3) = 2V3/9. This time we can use « = 0 
and « = —1, and we find that f(—1) = f(0) = 0 < 2V3/9, so there must be a local 
maximum at 2 = —V3/3. 


Of course this example is made very simple by our choice of points to test, namely 
x = —1, 0, 1. We could have used other values, say —5/4, 1/3, and 3/4, but this would 
have made the calculations considerably more tedious. 


EXAMPLE 5.1.3 Find all local maximum and minimum points for f(x) = sinx+cosz. 
The derivative is f’(~) = cosx — sina. This is always defined and is zero whenever 
cosx = sinz. Recalling that the cosx and sin x are the x and y coordinates of points on a 
unit circle, we see that cosx = sina when z is 7/4, 7/447, 7/4427, 7/4437, etc. Since 
both sine and cosine have a period of 27, we need only determine the status of « = 7/4 
and x = 57/4. We can use 0 and 7/2 to test the critical value x = 7/4. We find that 
f(a/4) = V2, f(0) = 1 < V2 and f(/2) = 1, so there is a local maximum when x = 77/4 
and also when x = 7/4427, 7/4+44r, etc. We can summarize this more neatly by saying 
that there are local maxima at 7/4 + 2kz for every integer k. 

We use 7 and 27 to test the critical value « = 57/4. The relevant values are f(57/4) = 
—V2, f(r) = -1 > —Vv2, f(2r) = 1 > —V2, so there is a local minimum at x = 51/4, 
50/4427, 507/4+4n, etc. More succinctly, there are local minima at 57/4+2kz7 for every 


integer k. 


Exercises 5.1. 


In problems 1-12, find all local maximum and minimum points (z, y) by the method of this section. 


1. y=ur?-24> 2. y=24+3ce-2°5 
3. y=u —9x? +240 => 4 y=u —22°4+35 
5. y=3a*— 42° > 6. y=(a’-1)/4> 
7. y=32" — (1/27) 3 8. y=cos(2x) —2# > 
r-3 @<3 
9. fla)= {257 a > 10. f(x) =< x 3<a<5s5 
* ee Le 25 
11. f(x) =2° -984+4> 12. Fa) ={ jes ion = 
jx 20 


13. For any real number z there is a unique integer n such that n < x < n+1, and the greatest 
integer function is defined as |x| = n. Where are the critical values of the greatest integer 
function? Which are local maxima and which are local minima? 


14. Explain why the function f(x) = 1/z has no local maxima or minima. 


15. How many critical points can a quadratic polynomial function have? > 
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16. Show that a cubic polynomial can have at most two critical points. Give examples to show 
that a cubic polynomial can have zero, one, or two critical points. 


17. Explore the family of functions f(2) = 2° + ca +1 where c is a constant. How many and 
what types of local extremes are there? Your answer should depend on the value of c, that 
is, different values of c will give different answers. 


18. We generalize the preceding two questions. Let n be a positive integer and let f be a poly- 
nomial of degree n. How many critical points can f have? (Hint: Recall the Fundamental 
Theorem of Algebra, which says that a polynomial of degree n has at most n roots.) 


5.2 THE FIRST DERIVATIVE TEST 


The method of the previous section for deciding whether there is a local maximum or 
minimum at a critical value is not always convenient. We can instead use information 
about the derivative f’(x) to decide; since we have already had to compute the derivative 
to find the critical values, there is often relatively little extra work involved in this method. 

How can the derivative tell us whether there is a maximum, minimum, or neither at 
a point? Suppose that f’(a) = 0. If there is a local maximum when z = a, the function 
must be lower near x = a than it is right at x = a. If the derivative exists near x = a, this 
means f’(x) > 0 when z is near a and x < a, because the function must “slope up” just 
to the left of a. Similarly, f’(2) < 0 when =z is near a and x > a, because f slopes down 
from the local maximum as we move to the right. Using the same reasoning, if there is 
a local minimum at x = a, the derivative of f must be negative just to the left of a and 
positive just to the right. If the derivative exists near a but does not change from positive 
to negative or negative to positive, that is, it is positive on both sides or negative on both 
sides, then there is neither a maximum nor minimum when zx = a. See the first graph in 


figure 5.1.1 and the graph in figure 5.1.2 for examples. 


EXAMPLE 5.2.1 Find all local maximum and minimum points for f(x) = sina+cosx 
using the first derivative test. The derivative is f’(x) = cosx—sin x and from example 5.1.3 
the critical values we need to consider are 7/4 and 57/4. 

The graphs of sinz and cos are shown in figure 5.2.1. Just to the left of 7/4 the 
cosine is larger than the sine, so f’(x) is positive; just to the right the cosine is smaller 
than the sine, so f’(x) is negative. This means there is a local maximum at 7/4. Just to 
the left of 57/4 the cosine is smaller than the sine, and to the right the cosine is larger 
than the sine. This means that the derivative f’(x) is negative to the left and positive to 


the right, so f has a local minimum at 57/4. 


Exercises 5.2. 


In 1-18, find all critical points and identify them as local maximum points, local minimum points, 
or neither. 
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Se Se 


11. 


13. 


14. 


15. 


16. 


17. 


Figure 5.2.1 The sine and cosine. 


you -2> 2. y=243¢-2° > 

y= —9a* + 240 > A, pe Si a 

— 6. y=(@-V/es 

y= 377 — (1/2) = 8. y=cos(2xz) —2# > 
f(2)=6—2)/@+2) > 10. f(x) = |x? —121| > 
f(x)=2°/(@+1I => 12. flay={ Pantie) ae 


f(x) =sin? sx > 


Find the maxima and minima of f(a) = secx. > 


Let f(@) = cos”(0)—2sin(0). Find the intervals where f is increasing and the intervals where 
f is decreasing in [0,27]. Use this information to classify the critical points of f as either 
local maximums, local minimums, or neither. = 


Let r > 0. Find the local maxima and minima of the function f(x) = Vr? — x? on its 
domain [—r,r]. 


Let f(x) = ax” + ba +c with a # 0. Show that f has exactly one critical point. Give 
conditions on a and b which guarantee that the critical point will be a maximum. It is 
possible to see this without using calculus at all; explain. 


5.3. THE SECOND DERIVATIVE TEST 


The basis of the first derivative test is that if the derivative changes from positive to 


negative at a point at which the derivative is zero then there is a local maximum at the 


point, and similarly for a local minimum. If f’ changes from positive to negative it is 


decreasing; this means that the derivative of f’, f’, might be negative, and if in fact f” 


is negative then f’ is definitely decreasing, so there is a local maximum at the point in 


question. Note well that f’ might change from positive to negative while f” is zero, in 


which case f” gives us no information about the critical value. Similarly, if f’ changes 


from negative to positive there is a local minimum at the point, and f’ is increasing. If 


f” > 0 at the point, this tells us that f’ is increasing, and so there is a local minimum. 
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EXAMPLE 5.3.1 Consider again f(x) = sing + cosz, with f’(x) = cosx — sinx and 
f(x) = —sinx — cosa. Since f’(/4) = —V2/2 — /2/2 = —V2 <0, we know there is a 
local maximum at 1/4. Since f’”"(57/4) = — — V2/2 — —V2/2 = V2 > 0, there is a local 


minimum at 57/4. 


When it works, the second derivative test is often the easiest way to identify local max- 
imum and minimum points. Sometimes the test fails, and sometimes the second derivative 


is quite difficult to evaluate; in such cases we must fall back on one of the previous tests. 


EXAMPLE 5.3.2 Let f(x) = x*. The derivatives are f’(x) = 4x? and f” (x) = 122”. 
Zero is the only critical value, but f’’(0) = 0, so the second derivative test tells us nothing. 
However, f(x) is positive everywhere except at zero, so clearly f(x) has a local minimum 
at, zero. On the other hand, f(x) = —2* also has zero as its only critical value, and the 


second derivative is again zero, but —x* has a local maximum at zero. 


Exercises 5.3. 


Find all local maximum and minimum points by the second derivative test, when possible. 


1. y=ur?-2> 2. y=24+3ce-2°5 
y= a? — 92? 4240 > A. y=au'—227 43> 
5. y=3a*-— 42° > 6. y=(a* —1)/e=> 
7. y= 32" —(1/z") => 8. y=cos(2z)-r > 
9 y=4r4+V1l-r4> 10. y= (£4 1)/V502? +35 => 
11. y=r-4> 12. y=6r+sin3¢ > 
13;. g=<e+1/e> 14, y=2?4+1/e> 
15. y=(e+5)'4 = 16. y=tan?« > 
17. y=cos’a—sin’?s > 18. y=sin?s > 


5.4 CONCAVITY AND INFLECTION POINTS 


We know that the sign of the derivative tells us whether a function is increasing or decreas- 
ing; for example, when f’(x) > 0, f(x) is increasing. The sign of the second derivative 
f(x) tells us whether f’ is increasing or decreasing; we have seen that if f’ is zero and 
increasing at a point then there is a local minimum at the point, and if f’ is zero and 
decreasing at a point then there is a local maximum at the point. Thus, we extracted 
information about f from information about f”. 

We can get information from the sign of f’ even when f’ is not zero. Suppose that 
f(a) > 0. This means that near x = a, f’ is increasing. If f’(a) > 0, this means that f 
slopes up and is getting steeper; if f’(a) < 0, this means that f slopes down and is getting 


112 Chapter 5 Curve Sketching 


a a 


Figure 5.4.1 f(a) > 0: f’(a) positive and increasing, f’(a) negative and increasing. 


less steep. The two situations are shown in figure 5.4.1. A curve that is shaped like this 
is called concave up. 

Now suppose that f(a) < 0. This means that near « = a, f’ is decreasing. If 
f'(a) > 0, this means that f slopes up and is getting less steep; if f’(a) < 0, this means 
that f slopes down and is getting steeper. The two situations are shown in figure 5.4.2. A 
curve that is shaped like this is called concave down. 


a a 


Figure 5.4.2 f(a) <0: f'(a) positive and decreasing, f’(a) negative and decreasing. 


If we are trying to understand the shape of the graph of a function, knowing where 
it is concave up and concave down helps us to get a more accurate picture. Of particular 
interest are points at which the concavity changes from up to down or down to up; such 
points are called inflection points. If the concavity changes from up to down at x = a, f” 
changes from positive to the left of a to negative to the right of a, and usually f”(a) = 0. 
We can identify such points by first finding where f’’(x) is zero and then checking to see 
whether f”(a) does in fact go from positive to negative or negative to positive at these 
points. Note that it is possible that f’”(a) = 0 but the concavity is the same on both sides; 


f(z) = x* at x = 0 is an example. 


EXAMPLE 5.4.1 Describe the concavity of f(x) = x°—2«. f'(x) = 3x?—-1, f" (x) = 62. 
Since f”(0) = 0, there is potentially an inflection point at zero. Since f”(x) > 0 when 
x >Oand f”(x) < 0 when x < 0 the concavity does change from down to up at zero, and 


the curve is concave down for all x < 0 and concave up for all x > 0. 


Note that we need to compute and analyze the second derivative to understand con- 
cavity, so we may as well try to use the second derivative test for maxima and minima. If 
for some reason this fails we can then try one of the other tests. 
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Exercises 5.4. 


Describe the concavity of the functions in 1-18. 


1 y=2?-2> 2. y=243¢-2°5 
y=u? — 9274240 => 4A. y=au'—22° 435 

5B. y=3e°—42° = 6. y=(e’-1)/e> 

7. y=32 —(1/z") = 8. y=sinx+cosz > 

9 y=4r4+vV1l-2r4> 10. y= (24 1)/V502? +35 => 

11. y=r-«4> 12. y=6r+sin3z > 

13. y=r+1/¢e¢> 14, y=x? 4+1/e> 

15. y=(e4+5)'4> 16. y=tan?« > 

17. y=cos’2—-sin?« > 18. y=sin?s > 


19. Identify the intervals on which the graph of the function f(x) = x! — 4x® + 10 is of one of 
these four shapes: concave up and increasing; concave up and decreasing; concave down and 
increasing; concave down and decreasing. => 


20. Describe the concavity of y = x® + ba? + cx +d. You will need to consider different cases, 
depending on the values of the coefficients. 


21. Let n be an integer greater than or equal to two, and suppose f is a polynomial of degree 
n. How many inflection points can f have? Hint: Use the second derivative test and the 
fundamental theorem of algebra. 


5.5 ASYMPTOTES AND OTHER THINGS TO LOOK FOR 


A vertical asymptote is a place where the function becomes infinite, typically because 
the formula for the function has a denominator that becomes zero. For example, the 
reciprocal function f(x) = 1/ax has a vertical asymptote at x = 0, and the function tan x 
has a vertical asymptote at x = 7/2 (and also at x = —7/2, x = 37/2, etc.). Whenever the 
formula for a function contains a denominator it is worth looking for a vertical asymptote 
by checking to see if the denominator can ever be zero, and then checking the limit at such 
points. Note that there is not always a vertical asymptote where the denominator is zero: 
f(x) = (sinx)/ax has a zero denominator at x = 0, but since Tim (sin x)/x = 1 there is no 


asymptote there. 

A horizontal asymptote is a horizontal line to which f(x) gets closer and closer as x 
approaches oo (or as x approaches —oo). For example, the reciprocal function has the 
x-axis for a horizontal asymptote. Horizontal asymptotes can be identified by computing 
the limits Jim f(x) and Jim f(x). Since Jim i= jim 1/xz = 0, the line y = 0 (that 


is, the x-axis) is a horizontal asymptote in both directions. 
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Some functions have asymptotes that are neither horizontal nor vertical, but some 
other line. Such asymptotes are somewhat more difficult to identify and we will ignore 
them. 

If the domain of the function does not extend out to infinity, we should also ask 
what happens as x approaches the boundary of the domain. For example, the function 


y = f(x) = 1/V r? — a? has domain —r < x < r, and y becomes infinite as x approaches 
either r or —r. In this case we might also identify this behavior because when x = +r the 
denominator of the function is zero. 

If there are any points where the derivative fails to exist (a cusp or corner), then we 
should take special note of what the function does at such a point. 

Finally, it is worthwhile to notice any symmetry. A function f(z) that has the same 
value for —x as for x, ie., f(—x) = f(x), is called an “even function.” Its graph is 
symmetric with respect to the y-axis. Some examples of even functions are: 7” when n 
is an even number, cosz, and sin?z. On the other hand, a function that satisfies the 
property f(—x) = —f(z) is called an “odd function.” Its graph is symmetric with respect 
to the origin. Some examples of odd functions are: «” when n is an odd number, sin x, and 
tan z. Of course, most functions are neither even nor odd, and do not have any particular 


symmetry. 


Exercises 5.5. 


Sketch the curves. Identify clearly any interesting features, including local maximum and mini- 
mum points, inflection points, asymptotes, and intercepts. You can use this Sage worksheet to 
check your answers. Note that you may need to adjust the interval over which the function is 
graphed to capture all the details. 


1. you —5a*+52° 2. y= x? — 3a? —9r +5 
3. y=(x—1)?(«4+3)" A, a? ta’y? =a’y’,a>0. 
5. y= ce” 6. y=(e +e *)/2 

7. y=e  cosz 8. y=e’ —sinz 

9. y=e"/z 10. y=4¢4+Vl—« 

11. y=(e@4+1)/V5x? +35 12. y=a2—2 

13. y=6xr+sin3x 14. y=ax+41/z 

15. y=a2? 41/2 16. y=(x+5)'4 

17. y=tan’a« 18. y=cos’a—sin’ x 
19. y=sin’« 20. y=2(2? +1) 

21. y=a2° +62? +92 22. y=2/(x" —9) 

23. y=a2/(x? +9) 24. y=2/e-—2 


25. y= 3sin(zx) — sin®(a), for x € [0, 27] 26. y= (x —1)/(2”) 
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For each of the following five functions, identify any vertical and horizontal asymptotes, and 
identify intervals on which the function is concave up and increasing; concave up and decreasing; 
concave down and increasing; concave down and decreasing. 


27. 
28. 
29. 
30. 
31. 
32. 


(9) = sec(9) 

f(x) =1/A +2") 

f(x) = (@ — 3)/(2x — 2) 

f(a) =1/(1- 2?) 

f(w) =1+1/(2) 

Let f(z) = 1/(x* — a”), where a > 0. Find any vertical and horizontal asymptotes and 
the intervals upon which the given function is concave up and increasing; concave up and 


decreasing; concave down and increasing; concave down and decreasing. Discuss how the 
value of a affects these features. 


6 


Applications of the Derivative 


6.1 OPTIMIZATION 


Many important applied problems involve finding the best way to accomplish some task. 
Often this involves finding the maximum or minimum value of some function: the minimum 
time to make a certain journey, the minimum cost for doing a task, the maximum power 
that can be generated by a device, and so on. Many of these problems can be solved by 
finding the appropriate function and then using techniques of calculus to find the maximum 
or the minimum value required. 

Generally such a problem will have the following mathematical form: Find the largest 
(or smallest) value of f(a) when a < x < b. Sometimes a or 0 are infinite, but frequently 
the real world imposes some constraint on the values that « may have. 

Such a problem differs in two ways from the local maximum and minimum problems 
we encountered when graphing functions: We are interested only in the function between 
a and b, and we want to know the largest or smallest value that f(x) takes on, not merely 
values that are the largest or smallest in a small interval. That is, we seek not a local 
maximum or minimum but a global maximum or minimum, sometimes also called an 
absolute maximum or minimum. 

Any global maximum or minimum must of course be a local maximum or minimum. 
If we find all possible local extrema, then the global maximum, ?f it exists, must be the 
largest of the local maxima and the global minimum, 7f zt exists, must be the smallest of 
the local minima. We already know where local extrema can occur: only at those points 
at which f’(2) is zero or undefined. Actually, there are two additional points at which a 


maximum or minimum can occur if the endpoints a and 6 are not infinite, namely, at a 
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—2 1 
Figure 6.1.1. The function f(x) = a” restricted to [—2, 1] 


and b. We have not previously considered such points because we have not been interested 
in limiting a function to a small interval. An example should make this clear. 


EXAMPLE 6.1.1 Find the maximum and minimum values of f(x) = x? on the interval 
[—2, 1], shown in figure 6.1.1. We compute f’(a) = 2a, which is zero at x = 0 and is always 
defined. 

Since f’(1) = 2 we would not normally flag x = 1 as a point of interest, but it is clear 
from the graph that when f(a) is restricted to |—2,1] there is a local maximum at x = 1. 
Likewise we would not normally pay attention to x = —2, but since we have truncated f 
at —2 we have introduced a new local maximum there as well. In a technical sense nothing 
new is going on here: When we truncate f we actually create a new function, let’s call it 
g, that is defined only on the interval [—2,1]. If we try to compute the derivative of this 
new function we actually find that it does not have a derivative at —2 or 1. Why? Because 
to compute the derivative at 1 we must compute the limit 

; 1+ Az) — g(1 
Jn, 


This limit does not exist because when Ax > 0, g(1 + Az) is not defined. It is simpler, 
however, simply to remember that we must always check the endpoints. 

So the function g, that is, f restricted to [—2, 1], has one critical value and two finite 
endpoints, any of which might be the global maximum or minimum. We could first deter- 
mine which of these are local maximum or minimum points (or neither); then the largest 
local maximum must be the global maximum and the smallest local minimum must be the 
global minimum. It is usually easier, however, to compute the value of f at every point 
at which the global maximum or minimum might occur; the largest of these is the global 
maximum, the smallest is the global minimum. 

So we compute f(—2) = 4, f(0) =0, f(1) =1. The global maximum is 4 at « = —2 
and the global minimum is 0 at x = 0. 
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It is possible that there is no global maximum or minimum. It is difficult, and not 
particularly useful, to express a complete procedure for determining whether this is the 
case. Generally, the best approach is to gain enough understanding of the shape of the 
graph to decide. Fortunately, only a rough idea of the shape is usually needed. 

There are some particularly nice cases that are easy. A continuous function on a closed 
interval [a, 6] always has both a global maximum and a global minimum, so examining the 
critical values and the endpoints is enough: 


THEOREM 6.1.2 Extreme value theorem _ If f is continuous on a closed interval 
(a, bj, then it has both a minimum and a maximum point. That is, there are real numbers 
c and d in |a, 6] so that for every x in [a,b], f(x) < f(c) and f(x) > f(d). | 


Another easy case: If a function is continuous and has a single critical value, then if 
there is a local maximum at the critical value it is a global maximum, and if it is a local 
minimum it is a global minimum. There may also be a global minimum in the first case, 
or a global maximum in the second case, but that will generally require more effort to 


determine. 


EXAMPLE 6.1.3 Let f(x) = —2? + 4x —3. Find the maximum value of f(a) on the 
interval (0, 4]. First note that f’(~) = —2x+4 =0 when x = 2, and f(2) = 1. Next observe 
that f’(x) is defined for all x, so there are no other critical values. Finally, f(0) = —3 and 
f(4) = —3. The largest value of f(x) on the interval [0,4] is f(2) = 1. 


EXAMPLE 6.1.4 Let f(x) = —x* + 42 — 3. Find the maximum value of f(z) on the 
interval [—1, 1]. 

First note that f’(x) = —2a+4=0 when x = 2. But x = 2 is not in the interval, so 
we don’t use it. Thus the only two points to be checked are the endpoints; f(—1) = —8 
and f(1) =0. So the largest value of f(z) on [—1,1] is f(1) =0. 


EXAMPLE 6.1.5 Find the maximum and minimum values of the function f(x) = 
7+ |x —2| for « between 1 and 4 inclusive. The derivative f’(x) is never zero, but f’(x) is 
undefined at 2 = 2, so we compute f(2) = 7. Checking the end points we get f(1) = 8 and 
f(4) = 9. The smallest of these numbers is f(2) = 7, which is, therefore, the minimum 


value of f(z) on the interval 1 < x < 4, and the maximum is f(4) = 9. 


EXAMPLE 6.1.6 Find all local maxima and minima for f(x) = «° — x, and deter- 
mine whether there is a global maximum or minimum on the open interval (—2,2). In 
example 5.1.2 we found a local maximum at (—V3/3,2V3/9) and a local minimum at 
(3/3, -2V3/9). Since the endpoints are not in the interval (—2,2) they cannot be con- 
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Figure 6.1.2 f(#)=2°—2 


sidered. Is the lone local maximum a global maximum? Here we must look more closely at 
the graph. We know that on the closed interval [—V3/3, V3/3] there is a global maximum 
at c= —V/3/ 3 and a global minimum at x = V3/ 3. So the question becomes: what hap- 
pens between —2 and —V3/3, and between 3/3 and 2? Since there is a local minimum 
at 2 = V3/3, the graph must continue up to the right, since there are no more critical 
values. This means no value of f will be less than —2V3/9 between V3/3 and 2, but it 
says nothing about whether we might find a value larger than the local maximum 2V3 /9. 
How can we tell? Since the function increases to the right of V3 /3, we need to know what 
the function values do “close to” 2. Here the easiest test is to pick a number and do a 
computation to get some idea of what’s going on. Since f(1.9) = 4.959 > 23/9, there 
is no global maximum at —V3/3, and hence no global maximum at all. (How can we tell 
that 4.959 > 2V3/ 9? We can use a calculator to approximate the right hand side; if it is 
not even close to 4.959 we can take this as decisive. Since 2V3/9 = 0.3849, there’s really 
no question. Funny things can happen in the rounding done by computers and calculators, 
however, so we might be a little more careful, especially if the values come out quite close. 
In this case we can convert the relation 4.959 > 2V3/9 into (9/2)4.959 > V3 and ask 
whether this is true. Since the left side is clearly larger than 4-4 which is clearly larger 
than V3, this settles the question.) 

A similar analysis shows that there is also no global minimum. The graph of f(x) on 
(—2,2) is shown in figure 6.1.2. 


EXAMPLE 6.1.7 Of all rectangles of area 100, which has the smallest perimeter? 
First we must translate this into a purely mathematical problem in which we want to 

find the minimum value of a function. If 2 denotes one of the sides of the rectangle, then 

the adjacent side must be 100/ (in order that the area be 100). So the function we want 
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to minimize is 
f(x) = 2a + pees 
x 
since the perimeter is twice the length plus twice the width of the rectangle. Not all values 
of x make sense in this problem: lengths of sides of rectangles must be positive, so x > 0. 
If x > 0 then so is 100/z, so we need no second condition on x. 

We next find f’(x) and set it equal to zero: 0 = f’(x) = 2—200/x?. Solving f’(a) = 0 
for x gives us x = +10. We are interested only in x > 0, so only the value x = 10 is of 
interest. Since f’() is defined everywhere on the interval (0, co), there are no more critical 
values, and there are no endpoints. Is there a local maximum, minimum, or neither at 
x = 10? The second derivative is f’(2) = 400/x°, and f”(10) > 0, so there is a local 


minimum. Since there is only one critical value, this is also the global minimum, so the 


rectangle with smallest perimeter is the 10 x 10 square. 


EXAMPLE 6.1.8 You want to sell a certain number n of items in order to maximize 
your profit. Market research tells you that if you set the price at $1.50, you will be able 
to sell 5000 items, and for every 10 cents you lower the price below $1.50 you will be able 
to sell another 1000 items. Suppose that your fixed costs (“start-up costs”) total $2000, 
and the per item cost of production (“marginal cost”) is $0.50. Find the price to set per 
item and the number of items sold in order to maximize profit, and also determine the 
maximum profit you can get. 

The first step is to convert the problem into a function maximization problem. Since 
we want to maximize profit by setting the price per item, we should look for a function 
P(x) representing the profit when the price per item is x. Profit is revenue minus costs, and 
revenue is number of items sold times the price per item, so we get P = nx — 2000 —0.50n. 
The number of items sold is itself a function of x, n = 5000 + 1000(1.5 — x)/0.10, because 
(1.5 — z)/0.10 is the number of multiples of 10 cents that the price is below $1.50. Now 
we substitute for n in the profit function: 


P(x) = (5000 + 1000(1.5 — x)/0.10)a — 2000 — 0.5(5000 + 1000(1.5 — x) /0.10) 
= —10000x? + 25000x — 12000 


We want to know the maximum value of this function when x is between 0 and 1.5. The 
derivative is P’(x) = —20000xz + 25000, which is zero when x = 1.25. Since P”(x) = 
—20000 < 0, there must be a local maximum at x = 1.25, and since this is the only critical 
value it must be a global maximum as well. (Alternately, we could compute P(0) = 
—12000, P(1.25) = 3625, and P(1.5) = 3000 and note that P(1.25) is the maximum of 
these.) Thus the maximum profit is $3625, attained when we set the price at $1.25 and 
sell 7500 items. 
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Figure 6.1.3 Rectangle in a parabola. 


EXAMPLE 6.1.9 Find the largest rectangle (that is, the rectangle with largest area) 
that fits inside the graph of the parabola y = x” below the line y = a (a is an unspecified 
constant value), with the top side of the rectangle on the horizontal line y = a; see 
figure 6.1.3.) 

We want to find the maximum value of some function A(x) representing area. Perhaps 
the hardest part of this problem is deciding what x should represent. The lower right 
corner of the rectangle is at (2,27), and once this is chosen the rectangle is completely 
determined. So we can let the x in A(x) be the x of the parabola f(x) = «?. Then the 
area is A(x) = (2x)(a—a”) = —22°+2aa2. We want the maximum value of A(x) when z is 
in (0, /a]. (You might object to allowing x = 0 or x = V/a, since then the “rectangle” has 
either no width or no height, so is not “really” a rectangle. But the problem is somewhat 
easier if we simply allow such rectangles, which have zero area.) 

Setting 0 = A’(x) = —6x? + 2a we get x = Ja/3 as the only critical value. Testing 
this and the two endpoints, we have A(0) = A(V/a) = 0 and A(\/a/3) = (4/9)V3a°/?. 


The maximum area thus occurs when the rectangle has dimensions 2,/a/3 x (2/3)a. 


EXAMPLE 6.1.10 If you fit the largest possible cone inside a sphere, what fraction 
of the volume of the sphere is occupied by the cone? (Here by “cone” we mean a right 
circular cone, i.e., a cone for which the base is perpendicular to the axis of symmetry, and 
for which the cross-section cut perpendicular to the axis of symmetry at any point is a 
circle.) 

Let R be the radius of the sphere, and let r and h be the base radius and height of 
the cone inside the sphere. What we want to maximize is the volume of the cone: 1r?h/3. 
Here R is a fixed value, but r and h can vary. Namely, we could choose r to be as large as 
possible—equal to R—by taking the height equal to R; or we could make the cone’s height 
h larger at the expense of making r a little less than R. See the cross-section depicted in 
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(h = R, r) 


Figure 6.1.4 Cone in a sphere. 


figure 6.1.4. We have situated the picture in a convenient way relative to the x and y axes, 
namely, with the center of the sphere at the origin and the vertex of the cone at the far 
left on the x-axis. 

Notice that the function we want to maximize, mr7h /3, depends on two variables. This 
is frequently the case, but often the two variables are related in some way so that “really” 
there is only one variable. So our next step is to find the relationship and use it to solve 
for one of the variables in terms of the other, so as to have a function of only one variable 
to maximize. In this problem, the condition is apparent in the figure: the upper corner of 
the triangle, whose coordinates are (hk — R,r), must be on the circle of radius R. That is, 


(h— RB)? +7? = R?. 


We can solve for h in terms of r or for r in terms of h. Either involves taking a square 
root, but we notice that the volume function contains r?, not r by itself, so it is easiest to 
solve for r? directly: r? = R? — (h — R)*. Then we substitute the result into mr7h/3: 


V(h) = 1(R? — (h— R)*)h/3 


1 2 
=-—h? + —ch?R 
a ge 

We want to maximize V(h) when h is between 0 and 2R. Now we solve 0 = f’(h) = 
—th? + (4/3)chR, getting h = 0 or h = 4R/3. We compute V(0) = V(2R) = 0 and 
V(4R/3) = (32/81)7R°. The maximum is the latter; since the volume of the sphere is 
(4/3)7R?®, the fraction of the sphere occupied by the cone is 


(32/81)7R® 8 


= 30%. 
(4/3)7R3 2h 7 
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EXAMPLE 6.1.11 You are making cylindrical containers to contain a given volume. 
Suppose that the top and bottom are made of a material that is N times as expensive 
(cost per unit area) as the material used for the lateral side of the cylinder. Find (in terms 
of N) the ratio of height to base radius of the cylinder that minimizes the cost of making 
the containers. 

Let us first choose letters to represent various things: h for the height, r for the base 
radius, V for the volume of the cylinder, and c for the cost per unit area of the lateral side 
of the cylinder; V and c¢ are constants, h and r are variables. Now we can write the cost 
of materials: 

c(2rrh) + Ne(2rr?). 


Again we have two variables; the relationship is provided by the fixed volume of the 
cylinder: V = rr7h. We use this relationship to eliminate h (we could eliminate r, but it’s 
a little easier if we eliminate h, which appears in only one place in the above formula for 
cost). The result is 


V 2cV 
f(r) = 2enr—, + 2Nemr? = + 2Nenr?. 
Tr i 


We want to know the minimum value of this function when r is in (0,00). We now set 
0 = f'(r) = -2cV/r? + 4Nenr, giving r = */V/(2N7r). Since f’(r) = 4cV/r? + 4Nen 
is positive when r is positive, there is a local minimum at the critical value, and hence a 
global minimum since there is only one critical value. 

Finally, since h = V/(rr?), 


h 4 V 


rare (V/(2Nn)) a 


y] 


so the minimum cost occurs when the height h is 2N times the radius. If, for example, 
there is no difference in the cost of materials, the height is twice the radius (or the height 


is equal to the diameter). 


D B C 


Figure 6.1.5 Minimizing travel time. 
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EXAMPLE 6.1.12 Suppose you want to reach a point A that is located across the 
sand from a nearby road (see figure 6.1.5). Suppose that the road is straight, and b is the 
distance from A to the closest point C on the road. Let v be your speed on the road, and 
let w, which is less than v, be your speed on the sand. Right now you are at the point 
D, which is a distance a from C. At what point B should you turn off the road and head 
across the sand in order to minimize your travel time to A? 

Let x be the distance short of C where you turn off, i.e., the distance from B to C’. We 
want to minimize the total travel time. Recall that when traveling at constant velocity, 
time is distance divided by velocity. 

You travel the distance DB at speed v, and then the distance BA at speed w. Since 
DB = a~—v<2 and, by the Pythagorean theorem, BA = \/x? + b?, the total time for the 
trip is 


aS Aa 
Vv w ; 


f(x) = 


We want to find the minimum value of f when x is between 0 and a. As usual we set 
f'(x) = 0 and solve for z: 


=e 
Notice that a does not appear in the last expression, but a is not irrelevant, since we 


are interested only in critical values that are in [0,a], and wb/\/v? — w? is either in this 
interval or not. If it is, we can use the second derivative to test it: 


b2 


f"'(a) = Capen 


Since this is always positive there is a local minimum at the critical point, and so it is a 
global minimum as well. 
If the critical value is not in [0,a] it is larger than a. In this case the minimum must 
occur at one of the endpoints. We can compute 
ab 
f= 2+ 


U W 
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but it is difficult to determine which of these is smaller by direct comparison. If, as is 


likely in practice, we know the values of v, w, a, and 6, then it is easy to determine this. 


With a little cleverness, however, we can determine the minimum in general. We have seen 


that f’(x) is always positive, so the derivative f’(x) is always increasing. We know that 


at wb/\/v? — w? the derivative is zero, so for values of x less than that critical value, the 
derivative is negative. This means that f(0) > f(a), so the minimum occurs when z = a. 


So the upshot is this: If you start farther away from C' than wb/./v? — w? then you 


always want to cut across the sand when you are a distance wb/\/v? — w? from point C. 


If you start closer than this to C, you should cut directly across the sand. 


Summary—Steps to solve an optimization problem. 


1. Decide what the variables are and what the constants are, draw a diagram if 
appropriate, understand clearly what it is that is to be maximized or minimized. 


2. Write a formula for the function for which you wish to find the maximum or 


minimum. 
3. Express that formula in terms of only one variable, that is, in the form f(z). 


4. Set f’(x) = 0 and solve. Check all critical values and endpoints to determine the 


extreme value. 


Exercises 6.1. 


1. 


1+4¢—2? forx <3 
Let a = 
ae) eee forz >3 
Find the maximum value and minimum values of f(x) for x in [0,4]. Graph f(a) to check 
your answers. => 


2. Find the dimensions of the rectangle of largest area having fixed perimeter 100. > 


. Find the dimensions of the rectangle of largest area having fixed perimeter P. > 


. A box with square base and no top is to hold a volume 100. Find the dimensions of the box 


that requires the least material for the five sides. Also find the ratio of height to side of the 
base. => 


. A box with square base is to hold a volume 200. The bottom and top are formed by folding 


in flaps from all four sides, so that the bottom and top consist of two layers of cardboard. 
Find the dimensions of the box that requires the least material. Also find the ratio of height 
to side of the base. => 


. A box with square base and no top is to hold a volume V. Find (in terms of V) the dimensions 


of the box that requires the least material for the five sides. Also find the ratio of height to 
side of the base. (This ratio will not involve V.) > 


. You have 100 feet of fence to make a rectangular play area alongside the wall of your house. 


The wall of the house bounds one side. What is the largest size possible (in square feet) for 
the play area? > 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 
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. You have | feet of fence to make a rectangular play area alongside the wall of your house. 


The wall of the house bounds one side. What is the largest size possible (in square feet) for 
the play area? > 


. Marketing tells you that if you set the price of an item at $10 then you will be unable to sell 


it, but that you can sell 500 items for each dollar below $10 that you set the price. Suppose 
your fixed costs total $3000, and your marginal cost is $2 per item. What is the most profit 
you can make? => 


Find the area of the largest rectangle that fits inside a semicircle of radius 10 (one side of 
the rectangle is along the diameter of the semicircle). > 


Find the area of the largest rectangle that fits inside a semicircle of radius r (one side of the 
rectangle is along the diameter of the semicircle). => 


For a cylinder with surface area 50, including the top and the bottom, find the ratio of height 
to base radius that maximizes the volume. => 


For a cylinder with given surface area S, including the top and the bottom, find the ratio of 
height to base radius that maximizes the volume. > 


You want to make cylindrical containers to hold 1 liter (1000 cubic centimeters) using the 
least amount of construction material. The side is made from a rectangular piece of material, 
and this can be done with no material wasted. However, the top and bottom are cut from 
squares of side 2r, so that 2(2r)? = 8r of material is needed (rather than 27r?, which is 
the total area of the top and bottom). Find the dimensions of the container using the least 
amount of material, and also find the ratio of height to radius for this container. > 


You want to make cylindrical containers of a given volume V using the least amount of 
construction material. The side is made from a rectangular piece of material, and this can 
be done with no material wasted. However, the top and bottom are cut from squares of side 
2r, so that 2(2r)” = 8r° of material is needed (rather than 2nr*, which is the total area of 
the top and bottom). Find the optimal ratio of height to radius. > 


Given a right circular cone, you put an upside-down cone inside it so that its vertex is at the 
center of the base of the larger cone and its base is parallel to the base of the larger cone. If 
you choose the upside-down cone to have the largest possible volume, what fraction of the 
volume of the larger cone does it occupy? (Let H and R be the height and base radius of 
the larger cone, and let h and r be the height and base radius of the smaller cone. Hint: Use 
similar triangles to get an equation relating h and r.) => 


In example 6.1.12, what happens if w > v (i.e., your speed on sand is at least your speed on 
the road)? > 


A container holding a fixed volume is being made in the shape of a cylinder with a hemi- 
spherical top. (The hemispherical top has the same radius as the cylinder.) Find the ratio of 
height to radius of the cylinder which minimizes the cost of the container if (a) the cost per 
unit area of the top is twice as great as the cost per unit area of the side, and the container 
is made with no bottom; (b) the same as in (a), except that the container is made with a 
circular bottom, for which the cost per unit area is 1.5 times the cost per unit area of the 
side. > 


A piece of cardboard is 1 meter by 1/2 meter. A square is to be cut from each corner and 
the sides folded up to make an open-top box. What are the dimensions of the box with 
maximum possible volume? => 
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20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


(a) A square piece of cardboard of side a is used to make an open-top box by cutting out 
a small square from each corner and bending up the sides. How large a square should be 
cut from each corner in order that the box have maximum volume? (b) What if the piece of 
cardboard used to make the box is a rectangle of sides a and b? => 


A window consists of a rectangular piece of clear glass with a semicircular piece of colored 
glass on top; the colored glass transmits only 1/2 as much light per unit area as the the clear 
glass. If the distance from top to bottom (across both the rectangle and the semicircle) is 
2 meters and the window may be no more than 1.5 meters wide, find the dimensions of the 
rectangular portion of the window that lets through the most light. > 


A window consists of a rectangular piece of clear glass with a semicircular piece of colored 
glass on top. Suppose that the colored glass transmits only k times as much light per unit 
area as the clear glass (k is between 0 and 1). If the distance from top to bottom (across 
both the rectangle and the semicircle) is a fixed distance H, find (in terms of k) the ratio of 
vertical side to horizontal side of the rectangle for which the window lets through the most 
light. => 


You are designing a poster to contain a fixed amount A of printing (measured in square 
centimeters) and have margins of a centimeters at the top and bottom and b centimeters at 
the sides. Find the ratio of vertical dimension to horizontal dimension of the printed area on 
the poster if you want to minimize the amount of posterboard needed. => 


The strength of a rectangular beam is proportional to the product of its width w times the 
square of its depth d. Find the dimensions of the strongest beam that can be cut from a 
cylindrical log of radius r. > 


— 


Figure 6.1.6 Cutting a beam. 


What fraction of the volume of a sphere is taken up by the largest cylinder that can be fit 
inside the sphere? => 

The U.S. post office will accept a box for shipment only if the sum of the length and girth 
(distance around) is at most 108 in. Find the dimensions of the largest acceptable box with 
square front and back. => 

Find the dimensions of the lightest cylindrical can containing 0.25 liter (=250 cm®) if the 
top and bottom are made of a material that is twice as heavy (per unit area) as the material 
used for the side. > 

A conical paper cup is to hold 1/4 of a liter. Find the height and radius of the cone which 
minimizes the amount of paper needed to make the cup. Use the formula mrv/r? + h? for 
the area of the side of a cone. > 

A conical paper cup is to hold a fixed volume of water. Find the ratio of height to base radius 
of the cone which minimizes the amount of paper needed to make the cup. Use the formula 
mrv/r2 + h? for the area of the side of a cone, called the lateral area of the cone. => 


30. 


31. 


32. 


33. 


34. 
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If you fit the cone with the largest possible surface area (lateral area plus area of base) into 
a sphere, what percent of the volume of the sphere is occupied by the cone? => 


Two electrical charges, one a positive charge A of magnitude a and the other a negative 
charge B of magnitude b, are located a distance c apart. A positively charged particle P is 
situated on the line between A and B. Find where P should be put so that the pull away 
from A towards B is minimal. Here assume that the force from each charge is proportional 
to the strength of the source and inversely proportional to the square of the distance from 
the source. => 


Find the fraction of the area of a triangle that is occupied by the largest rectangle that can 
be drawn in the triangle (with one of its sides along a side of the triangle). Show that this 
fraction does not depend on the dimensions of the given triangle. > 


How are your answers to Problem 9 affected if the cost per item for the x items, instead 
of being simply $2, decreases below $2 in proportion to x (because of economy of scale and 
volume discounts) by 1 cent for each 25 items produced? => 


You are standing near the side of a large wading pool of uniform depth when you see a child 
in trouble. You can run at a speed vi on land and at a slower speed v2 in the water. Your 
perpendicular distance from the side of the pool is a, the child’s perpendicular distance is b, 
and the distance along the side of the pool between the closest point to you and the closest 
point to the child is c (see the figure below). Without stopping to do any calculus, you 
instinctively choose the quickest route (shown in the figure) and save the child. Our purpose 
is to derive a relation between the angle 6; your path makes with the perpendicular to the side 
of the pool when you’re on land, and the angle 62 your path makes with the perpendicular 
when you’re in the water. To do this, let x be the distance between the closest point to you 
at the side of the pool and the point where you enter the water. Write the total time you 
run (on land and in the water) in terms of x (and also the constants a, b,c, v1, v2). Then set 
the derivative equal to zero. The result, called “Snell’s law” or the “law of refraction,” also 
governs the bending of light when it goes into water. > 


Figure 6.1.7 Wading pool rescue. 


6.2 RELATED RATES 


Suppose we have two variables x and y (in most problems the letters will be different, 


but for now let’s use x and y) which are both changing with time. A “related rates” 


problem is a problem in which we know one of the rates of change at a given instant—-say, 
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& = dx/dt—and we want to find the other rate y = dy/dt at that instant. (The use of 
& to mean dx/dt goes back to Newton and is still used for this purpose, especially by 
physicists. ) 

If y is written in terms of z, i.e., y = f(x), then this is easy to do using the chain rule: 


dy dy dx dy, 


a ae ae ee 


That is, find the derivative of f(x), plug in the value of x at the instant in question, and 
multiply by the given value of « = dx/dt to get y = dy/dt. 


EXAMPLE 6.2.1 Suppose an object is moving along a path described by y = x”, that 
is, it is moving on a parabolic path. At a particular time, say t = 5, the x coordinate is 
6 and we measure the speed at which the x coordinate of the object is changing and find 
that dx/dt = 3. At the same time, how fast is the y coordinate changing? 

Using the chain rule, dy/dt = 2x-dax/dt. At t = 5 we know that x = 6 and dx/dt = 3, 
so dy/dt = 2-6-3 = 36. 


In many cases, particularly interesting ones, x and y will be related in some other way, 
for example x = f(y), or F(z,y) = k, or perhaps F(z,y) = G(z,y), where F(z,y) and 
G(x,y) are expressions involving both variables. In all cases, you can solve the related 
rates problem by taking the derivative of both sides, plugging in all the known values 
(namely, x, y, and «), and then solving for 4. 

To summarize, here are the steps in doing a related rates problem: 


1. Decide what the two variables are. 
. Find an equation relating them. 
. Take d/dt of both sides. 


. Plug in all known values at the instant in question. 


a -. WwW WD 


. Solve for the unknown rate. 


EXAMPLE 6.2.2. A plane is flying directly away from you at 500 mph at an altitude 
of 3 miles. How fast is the plane’s distance from you increasing at the moment when the 
plane is flying over a point on the ground 4 miles from you? 

To see what’s going on, we first draw a schematic representation of the situation, as 
in figure 6.2.1. 

Because the plane is in level flight directly away from you, the rate at which x changes 
is the speed of the plane, dx/dt = 500. The distance between you and the plane is y; it 
is dy/dt that we wish to know. By the Pythagorean Theorem we know that x? +9 = y’. 
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Figure 6.2.1 Receding airplane. 


Taking the derivative: 
2 = fay. 


We are interested in the time at which x = 4; at this time we know that 4° +9 = y?, so 
y = 5. Putting together all the information we get 


2(4)(500) = 2(5)yp. 


Thus, y = 400 mph. 


EXAMPLE 6.2.3 You are inflating a spherical balloon at the rate of 7 cm?/sec. How 
fast is its radius increasing when the radius is 4 cm? 

Here the variables are the radius r and the volume V. We know dV/dt, and we want 
dr/dt. The two variables are related by means of the equation V = 4rr°/3. Taking the 
derivative of both sides gives dV/dt = 4rr?7. We now substitute the values we know at 


the instant in question: 7 = 47477, so ¢ = 7/(647) cm/sec. 


EXAMPLE 6.2.4 Water is poured into a conical container at the rate of 10 cm?/sec. 
The cone points directly down, and it has a height of 30 cm and a base radius of 10 cm; see 
figure 6.2.2. How fast is the water level rising when the water is 4 cm deep (at its deepest 
point)? 

The water forms a conical shape within the big cone; its height and base radius and 
volume are all increasing as water is poured into the container. This means that we actually 
have three things varying with time: the water level h (the height of the cone of water), 
the radius r of the circular top surface of water (the base radius of the cone of water), and 
the volume of water V. The volume of a cone is given by V = mr7h/3. We know dV/dt, 
and we want dh/dt. At first something seems to be wrong: we have a third variable r 
whose rate we don’t know. 

But the dimensions of the cone of water must have the same proportions as those of 
the container. That is, because of similar triangles, r/h = 10/30 so r = h/3. Now we can 
eliminate r from the problem entirely: V = 1(h/3)?h/3 = th?/27. We take the derivative 
of both sides and plug in h = 4 and dV/dt = 10, obtaining 10 = (37 -4?/27)(dh/dt). Thus, 
dh/dt = 90/(167) cm/sec. 
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Figure 6.2.2 Conical water tank. 


EXAMPLE 6.2.5 A swing consists of a board at the end of a 10 ft long rope. Think 
of the board as a point P at the end of the rope, and let Q be the point of attachment 
at the other end. Suppose that the swing is directly below Q at time t = 0, and is being 
pushed by someone who walks at 6 ft/sec from left to right. Find (a) how fast the swing 
is rising after 1 sec; (b) the angular speed of the rope in deg/sec after 1 sec. 

We start out by asking: What is the geometric quantity whose rate of change we know, 
and what is the geometric quantity whose rate of change we’re being asked about? Note 
that the person pushing the swing is moving horizontally at a rate we know. In other 
words, the horizontal coordinate of P is increasing at 6 ft/sec. In the xy-plane let us 
make the convenient choice of putting the origin at the location of P at time t = 0, i.e., a 
distance 10 directly below the point of attachment. Then the rate we know is dx/dt, and 
in part (a) the rate we want is dy/dt (the rate at which P is rising). In part (b) the rate 
we want is 6 = d@ /dt, where @ stands for the angle in radians through which the swing has 
swung from the vertical. (Actually, since we want our answer in deg/sec, at the end we 
must convert d0/dt from rad/sec by multiplying by 180/z.) 

(a) From the diagram we see that we have a right triangle whose legs are x and 10 — y, 
and whose hypotenuse is 10. Hence x” + (10 — y)? = 100. Taking the derivative of both 
sides we obtain: 2x4 + 2(10 — y)(0—y) = 0. We now look at what we know after 1 second, 
namely x = 6 (because x started at 0 and has been increasing at the rate of 6 ft/sec 
for 1 sec), y = 2 (because we get 10 — y = 8 from the Pythagorean theorem applied to 
the triangle with hypotenuse 10 and leg 6), and « = 6. Putting in these values gives us 
2-6-6—2-8y =0, from which we can easily solve for y: y = 4.5 ft/sec. 

(b) Here our two variables are x and 6, so we want to use the same right triangle as 
in part (a), but this time relate 0 to x. Since the hypotenuse is constant (equal to 10), 
the best way to do this is to use the sine: sin? = 2/10. Taking derivatives we obtain 
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Figure 6.2.3 Swing. 


(cos 0)0 = 0.1%. At the instant in question (t = 1 sec), when we have a right triangle with 
sides 6-8-10, cos? = 8/10 and « = 6. Thus (8/10)6 = 6/10, i.e., 6 = 6/8 = 3/4 rad/sec, 
or approximately 43 deg/sec. 


We have seen that sometimes there are apparently more than two variables that change 
with time, but in reality there are just two, as the others can be expressed in terms of 
just two. But sometimes there really are several variables that change with time; as long 
as you know the rates of change of all but one of them you can find the rate of change of 
the remaining one. As in the case when there are just two variables, take the derivative 
of both sides of the equation relating all of the variables, and then substitute all of the 


known values and solve for the unknown rate. 


EXAMPLE 6.2.6 A road running north to south crosses a road going east to west at 
the point P. Car A is driving north along the first road, and car B is driving east along the 
second road. At a particular time car A is 10 kilometers to the north of P and traveling at 
80 km/hr, while car B is 15 kilometers to the east of P and traveling at 100 km/hr. How 
fast is the distance between the two cars changing? 


(0, ah 


- (b(t), 0) 


Figure 6.2.4 Cars moving apart. 
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Let a(t) be the distance of car A north of P at time t, and b(t) the distance of car B east 


of P at time t, and let c(t) be the distance from car A to car B at time t. By the Pythagorean 
Theorem, c(t)? = a(t)*+b(t)?. Taking derivatives we get 2c(t)c'(t) = 2a(t)a’(t) +2b(t)b'(t), 


sO 


os aa + bb _ aa + bb 


Substituting known values we get: 


at the time of interest. 


.  10-804+15-100 460 
Vv 102 + 152 V13 


= 127.6km/hr 


Notice how this problem differs from example 6.2.2. In both cases we started with the 


Pythagorean Theorem and took derivatives on both sides. However, in example 6.2.2 one 


of the sides was a constant (the altitude of the plane), and so the derivative of the square 


of that side of the triangle was simply zero. In this example, on the other hand, all three 


sides of the right triangle are variables, even though we are interested in a specific value 


of each side of the triangle (namely, when the sides have lengths 10 and 15). Make sure 


that you understand at the start of the problem what are the variables and what are the 


constants. 


Exercises 6.2. 


1. A cylindrical tank standing upright (with one circular base on the ground) has radius 20 


cm. How fast does the water level in the tank drop when the water is being drained at 25 
cm?/sec? > 


. A cylindrical tank standing upright (with one circular base on the ground) has radius 1 


meter. How fast does the water level in the tank drop when the water is being drained at 3 
liters per second? => 


. A ladder 13 meters long rests on horizontal ground and leans against a vertical wall. The 


foot of the ladder is pulled away from the wall at the rate of 0.6 m/sec. How fast is the top 
sliding down the wall when the foot of the ladder is 5 m from the wall? > 


. A ladder 13 meters long rests on horizontal ground and leans against a vertical wall. The 


top of the ladder is being pulled up the wall at 0.1 meters per second. How fast is the foot 
of the ladder approaching the wall when the foot of the ladder is 5 m from the wall? => 


. A rotating beacon is located 2 miles out in the water. Let A be the point on the shore that 


is closest to the beacon. As the beacon rotates at 10 rev/min, the beam of light sweeps down 
the shore once each time it revolves. Assume that the shore is straight. How fast is the point 
where the beam hits the shore moving at an instant when the beam is lighting up a point 2 
miles along the shore from the point A? > 


. A baseball diamond is a square 90 ft on a side. A player runs from first base to second base 


at 15 ft/sec. At what rate is the player’s distance from third base decreasing when she is 
half way from first to second base? = 


10. 


11. 


12. 


13. 
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. Sand is poured onto a surface at 15 cm®/sec, forming a conical pile whose base diameter is 


always equal to its altitude. How fast is the altitude of the pile increasing when the pile is 3 
cm high? > 


. A boat is pulled in to a dock by a rope with one end attached to the front of the boat and 


the other end passing through a ring attached to the dock at a point 5 ft higher than the 
front of the boat. The rope is being pulled through the ring at the rate of 0.6 ft/sec. How 
fast is the boat approaching the dock when 13 ft of rope are out? => 


. A balloon is at a height of 50 meters, and is rising at the constant rate of 5 m/sec. A bicyclist 


passes beneath it, traveling in a straight line at the constant speed of 10 m/sec. How fast is 
the distance between the bicyclist and the balloon increasing 2 seconds later? => 


A pyramid-shaped vat has square cross-section and stands on its tip. The dimensions at the 
top are 2m Xx 2m, and the depth is 5 m. If water is flowing into the vat at 3 m?/min, how 
fast is the water level rising when the depth of water (at the deepest point) is 4 m? Note: 
the volume of any “conical” shape (including pyramids) is (1/3)(height) (area of base). > 


The sun is rising at the rate of 1/4 deg/min, and appears to be climbing into the sky 
perpendicular to the horizon, as depicted in figure 6.2.5. How fast is the shadow of a 200 
meter building shrinking at the moment when the shadow is 500 meters long? => 


The sun is setting at the rate of 1/4 deg/min, and appears to be dropping perpendicular to 
the horizon, as depicted in figure 6.2.5. How fast is the shadow of a 25 meter wall lengthening 
at the moment when the shadow is 50 meters long? => 


Figure 6.2.5 Sunrise or sunset. 


The trough shown in figure 6.2.6 is constructed by fastening together three slabs of wood of 
dimensions 10 ft x 1 ft, and then attaching the construction to a wooden wall at each end. 
The angle @ was originally 30°, but because of poor construction the sides are collapsing. 
The trough is full of water. At what rate (in ft®/sec) is the water spilling out over the top 
of the trough if the sides have each fallen to an angle of 45°, and are collapsing at the rate 
of 1° per second? => 


Figure 6.2.6 ‘Trough. 
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14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


A woman 5 ft tall walks at the rate of 3.5 ft/sec away from a streetlight that is 12 ft above 
the ground. At what rate is the tip of her shadow moving? At what rate is her shadow 
lengthening? => 


A man 1.8 meters tall walks at the rate of 1 meter per second toward a streetlight that is 4 
meters above the ground. At what rate is the tip of his shadow moving? At what rate is his 
shadow shortening? => 


A police helicopter is flying at 150 mph at a constant altitude of 0.5 mile above a straight 
road. The pilot uses radar to determine that an oncoming car is at a distance of exactly 1 
mile from the helicopter, and that this distance is decreasing at 190 mph. Find the speed of 
the car. > 


A police helicopter is flying at 200 kilometers per hour at a constant altitude of 1 km above 
a straight road. The pilot uses radar to determine that an oncoming car is at a distance 
of exactly 2 kilometers from the helicopter, and that this distance is decreasing at 250 kph. 
Find the speed of the car. > 


A light shines from the top of a pole 20 m high. A ball is falling 10 meters from the pole, 
casting a shadow on a building 30 meters away, as shown in figure 6.2.7. When the ball is 25 
meters from the ground it is falling at 6 meters per second. How fast is its shadow moving? 
=> 


Figure 6.2.7 ‘Falling ball. 


Do example 6.2.6 assuming that the angle between the two roads is 120° instead of 90° (that 
is, the “north-south” road actually goes in a somewhat northwesterly direction from P). 
Recall the law of cosines: c? = a” + b” — 2abcos 6. => 


Do example 6.2.6 assuming that car A is 300 meters north of P, car B is 400 meters east 
of P, both cars are going at constant speed toward P, and the two cars will collide in 10 
seconds. => 


Do example 6.2.6 assuming that 8 seconds ago car A started from rest at P and has been 
picking up speed at the steady rate of 5 m/sec”, and 6 seconds after car A started car B 
passed P moving east at constant speed 60 m/sec. => 

Referring again to example 6.2.6, suppose that instead of car B an airplane is flying at speed 
200 km/hr to the east of P at an altitude of 2 km, as depicted in figure 6.2.8. How fast is 
the distance between car and airplane changing? > 

Referring again to example 6.2.6, suppose that instead of car B an airplane is flying at speed 
200 km/hr to the east of P at an altitude of 2 km, and that it is gaining altitude at 10 km/hr. 
How fast is the distance between car and airplane changing? => 

A light shines from the top of a pole 20 m high. An object is dropped from the same height 
from a point 10 m away, so that its height at time t seconds is h(t) = 20 — 9.8¢7/2. How fast 
is the object’s shadow moving on the ground one second later? = 
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Figure 6.2.8 Car and airplane. 


25. The two blades of a pair of scissors are fastened at the point A as shown in figure 6.2.9. Let 
a denote the distance from A to the tip of the blade (the point B). Let 6 denote the angle 
at the tip of the blade that is formed by the line AB and the bottom edge of the blade, line 
BC, and let 6 denote the angle between AB and the horizontal. Suppose that a piece of 
paper is cut in such a way that the center of the scissors at A is fixed, and the paper is also 
fixed. As the blades are closed (i.e., the angle 0 in the diagram is decreased), the distance x 
between A and C increases, cutting the paper. 


a. Express x in terms of a, 0, and £. 
b. Express dx/dt in terms of a, 0, 6, and d6/dt. 


c. Suppose that the distance a is 20 cm, and the angle is 5°. Further suppose that 0 
is decreasing at 50 deg/sec. At the instant when 6 = 30°, find the rate (in cm/sec) 
at which the paper is being cut. > 


Figure 6.2.9 Scissors. 


6.3 NEWTON’S METHOD 


Suppose you have a function f(x), and you want to find as accurately as possible where 
it crosses the x-axis; in other words, you want to solve f(x) = 0. Suppose you know of 
no way to find an exact solution by any algebraic procedure, but you are able to use an 
approximation, provided it can be made quite close to the true value. Newton’s method is 
a way to find a solution to the equation to as many decimal places as you want. It is what 
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is called an “iterative procedure,” meaning that it can be repeated again and again to 
get an answer of greater and greater accuracy. Iterative procedures like Newton’s method 
are well suited to programming for a computer. Newton’s method uses the fact that the 
tangent line to a curve is a good approximation to the curve near the point of tangency. 


EXAMPLE 6.3.1 Approximate V3. Since V3 is a solution to 2? = 3 or x? —3 = 0, we 
use f(x) = x” —3. We start by guessing something reasonably close to the true value; this 
is usually easy to do; let’s use V3 2. Now use the tangent line to the curve when x = 2 
as an approximation to the curve, as shown in figure 6.3.1. Since f’(x) = 2a, the slope of 
this tangent line is 4 and its equation is y = 4x — 7. The tangent line is quite close to f(x), 
so it crosses the x-axis near the point at which f(x) crosses, that is, near V3. It is easy 
to find where the tangent line crosses the x-axis: solve 0 = 4x — 7 to get = 7/4 = 1.75. 
This is certainly a better approximation than 2, but let us say not close enough. We can 
improve it by doing the same thing again: find the tangent line at x = 1.75, find where 
this new tangent line crosses the x-axis, and use that value as a better approximation. We 
can continue this indefinitely, though it gets a bit tedious. Lets see if we can shortcut the 
process. Suppose the best approximation to the intercept we have so far is x;. To find 
a better approximation we will always do the same thing: find the slope of the tangent 
line at x;, find the equation of the tangent line, find the x-intercept. The slope is 22;. 
The tangent line is y = (2a;)(a — 2;) + (a? — 3), using the point-slope formula for a line. 
Finally, the intercept is found by solving 0 = (2a;)(x —2;) + (2? —3). With a little algebra 
this turns into x = (#7 + 3)/(22;); this is the next approximation, which we naturally call 
x;41- Instead of doing the whole tangent line computation every time we can simply use 
this formula to get as many approximations as we want. Starting with xp = 2, we get 
a1 = (#2 +3)/(229) = (27 +3)/4 = 7/4 (the same approximation we got above, of course), 
vq = (2? +3)/(221) = ((7/4)? + 8)/(7/2) = 97/56 & 1.73214, x3 ~ 1.73205, and so on. 
This is still a bit tedious by hand, but with a calculator or, even better, a good computer 
program, it is quite easy to get many, many approximations. We might guess already that 
1.73205 is accurate to two decimal places, and in fact it turns out that it is accurate to 5 


places. 


Let’s think about this process in more general terms. We want to approximate a 
solution to f(x) = 0. We start with a rough guess, which we call xp. We use the tangent 
line to f(x) to get a new approximation that we hope will be closer to the true value. 
What is the equation of the tangent line when xz = xo? The slope is f’(xo) and the line 
goes through (xo, f(2o0)), so the equation of the line is 


y = f'(xo)(@ — to) + f (a0). 
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Figure 6.3.1 Newton’s method. (AP) 


Now we find where this crosses the x-axis by substituting y = 0 and solving for z: 


re mos (to) = 7 0). f (20) 


Lo 4 
f'(o) f'(xo) 
We will typically want to compute more than one of these improved approximations, so 
we number them consecutively; from x9 we have computed 71: 


ae tof"(to) — f(to) _ aon — £ (20) 
f'(o) © F"(@0)’ 
and in general from x; we compute 241: 
er vif'(vi) — f(a) Se f (zi) 
= f(a)  f'@ 


EXAMPLE 6.3.2 Returning to the previous example, f(x) = x” — 3, f’(x) = 2a, and 
the formula becomes 2;41 = 2; — (2? — 3)/(2a;) = (x? + 3)/(2a;), as before. 


In practice, which is to say, if you need to approximate a value in the course of 
designing a bridge or a building or an airframe, you will need to have some confidence that 
the approximation you settle on is accurate enough. As a rule of thumb, once a certain 
number of decimal places stop changing from one approximation to the next it is likely 
that those decimal places are correct. Still, this may not be enough assurance, in which 
case we can test the result for accuracy. 


EXAMPLE 6.3.3 Find the x coordinate of the intersection of the curves y = 2x and 
y = tan 2, accurate to three decimal places. To put this in the context of Newton’s method, 
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we note that we want to know where 2x = tanz or f(x) = tanxz — 2x = 0. We compute 
f'(x) = sec? x — 2 and set up the formula: 

tan 2; = 22; 

Se ee gece Li-2- 

From the graph in figure 6.3.2 we guess 79 = 1 as a starting point, then using the formula 
we compute x7, = 1.310478030, x2 = 1.223929096, x3 = 1.176050900, x4 = 1.165926508, 
Z5 = 1.165561636. So we guess that the first three places are correct, but that is not 
the same as saying 1.165 is correct to three decimal places—1.166 might be the correct, 
rounded approximation. How can we tell? We can substitute 1.165, 1.1655 and 1.166 into 
tan x — 2x; this gives —0.002483652, —0.000271247, 0.001948654. Since the first two are 
negative and the third is positive, tan x — 2x” crosses the x axis between 1.1655 and 1.166, 


so the correct value to three places is 1.166. 


15 4 In — 

10 4 10 4 

54 5 4 

0 ; 0 
i 1.5 1 1.5 


Figure 6.3.2. y=tanz and y = 22 on the left, y = tanx — 2 on the right. 


Exercises 6.3. 


You may want to use this Sage worksheet. 

1. Approximate the fifth root of 7, using xo = 1.5 as a first guess. Use Newton’s method to find 
x3 aS your approximation. => 

2. Use Newton’s Method to approximate the cube root of 10 to two decimal places. => 

3. The function f(r) = 2° — 32? — 34 + 6 has a root between 3 and 4, because f(3) = —3 and 
f(4) = 10. Approximate the root to two decimal places. > 

4. A rectangular piece of cardboard of dimensions 8 x 17 is used to make an open-top box 
by cutting out a small square of side x from each corner and bending up the sides. (See 
exercise 20 in 6.1.) If « = 2, then the volume of the box is 2. 4-13 = 104. Use Newton’s 
method to find a value of x for which the box has volume 100, accurate to 3 significant 
figures. > 
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6.4 LINEAR APPROXIMATIONS 


Newton’s method is one example of the usefulness of the tangent line as an approximation 
to a curve. Here we explore another such application. 

Recall that the tangent line to f(x) at a point x = a is given by L(x) = f’(a)(a—a)+ 
f(a). The tangent line in this context is also called the linear approximation to f at a. 

If f is differentiable at a then L is a good approximation of f so long as x is “not too 
far” from a. Put another way, if f is differentiable at a then under a microscope f will 
look very much like a straight line. Figure 6.4.1 shows a tangent line to y = x? at three 
different magnifications. 

If we want to approximate f(b), because computing it exactly is difficult, we can 
approximate the value using a linear approximation, provided that we can compute the 
tangent line at some a close to b. 


0.3704 


0.3604 


0.355 4 


0.3504 


T T T T 1 T T T T 1 
0.50 0.55 0.60 0.65 0.70 0.590 0.595 0.600 0.605 0.610 


Figure 6.4.1 The linear approximation to y = 2”. 


EXAMPLE 6.4.1 Let f(x) = Vx+4. Then f’(x) = 1/(2Vx+4). The linear ap- 
proximation to f at ¢ = 5 is L(x) = 1/(2V5+4)(2@ —5) + V5+-4 = (2 —5)/6+3. As 
an immediate application we can approximate square roots of numbers near 9 by hand. 
To estimate 10, we substitute 6 into the linear approximation instead of into f (x), so 
V6+4 (6—5)/6+3 = 19/6 = 3.16. This rounds to 3.17 while the square root of 10 is 
actually 3.16 to two decimal places, so this estimate is only accurate to one decimal place. 
This is not too surprising, as 10 is really not very close to 9; on the other hand, for many 


calculations, 3.2 would be accurate enough. 


With modern calculators and computing software it may not appear necessary to use 
linear approximations. But in fact they are quite useful. In cases requiring an explicit 
numerical approximation, they allow us to get a quick rough estimate which can be used 


as a “reality check” on a more complex calculation. In some complex calculations involving 
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functions, the linear approximation makes an otherwise intractable calculation possible, 
without serious loss of accuracy. 


EXAMPLE 6.4.2 Consider the trigonometric function sin x. Its linear approximation 
at x = 0 is simply L(x) = x. When z is small this is quite a good approximation and is 


used frequently by engineers and scientists to simplify some calculations. 


DEFINITION 6.4.3 Let y = f(x) be a differentiable function. We define a new 
independent variable dx, and a new dependent variable dy = f’(x) dx. Notice that dy is a 
function both of x (since f’() is a function of x) and of dx. We say that dx and dy are 


differentials. 


Let Ax = x —a and Ay = f(z) — f(a). If z is near a then Az is small. If we set 
dx = Az then 


A 
dy = f'(a) da © x he =a 


Thus, dy can be used to approximate Ay, the actual change in the function f between a 
and x. This is exactly the approximation given by the tangent line: 


dy = f'(a)(z — a) = f'(a)(@— a) + f(a) — f(a) = L(x) — f(a). 


While L(x) approximates f(x), dy approximates how f(a) has changed from f(a). Fig- 
ure 6.4.2 illustrates the relationships. 


Figure 6.4.2 Differentials. 
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Exercises 6.4. 


Let f(z) =a". If a=1 and dx = Ax = 1/2, what are Ay and dy? > 
Let f(x) = Vx. If a=1 and dx = Ax = 1/10, what are Ay and dy? > 
Let f(x) =sin(2x). Ifa =a and dx = Ax = 7/100, what are Ay and dy? => 


Use differentials to estimate the amount of paint needed to apply a coat of paint 0.02 cm 
thick to a sphere with diameter 40 meters. (Recall that the volume of a sphere of radius r is 
V = (4/3)rr*. Notice that you are given that dr = 0.0002.) > 


5. Show in detail that the linear approximation of sinz at « = 0 is L(x) = x and the linear 
approximation of cosxz at « = 0 is L(#) = 1. 


aa ae ea 


6.5 THE MEAN VALUE THEOREM 


Here are two interesting questions involving derivatives: 


1. Suppose two different functions have the same derivative; what can you say about 
the relationship between the two functions? 


2. Suppose you drive a car from toll booth on a toll road to another toll booth at 
an average speed of 70 miles per hour. What can be concluded about your actual 
speed during the trip? In particular, did you exceed the 65 mile per hour speed 
limit? 

While these sound very different, it turns out that the two problems are very closely 
related. We know that “speed” is really the derivative by a different name; let’s start by 
translating the second question into something that may be easier to visualize. Suppose 
that the function f(t) gives the position of your car on the toll road at time t. Your change 
in position between one toll booth and the next is given by f(t) — f(to), assuming that 
at time to you were at the first booth and at time ¢, you arrived at the second booth. 
Your average speed for the trip is (f(t1) — f(to))/(t1 — to). If we think about the graph 
of f(t), the average speed is the slope of the line that connects the two points (to, f(to)) 
and (t1, f(t1)). Your speed at any particular time t between to and t, is f’(t), the slope 
of the curve. Now question (2) becomes a question about slope. In particular, if the slope 
between endpoints is 70, what can be said of the slopes at points between the endpoints? 

As a general rule, when faced with a new problem it is often a good idea to examine 
one or more simplified versions of the problem, in the hope that this will lead to an 
understanding of the original problem. In this case, the problem in its “slope” form is 
somewhat easier to simplify than the original, but equivalent, problem. 

Here is a special instance of the problem. Suppose that f(to) = f(ti). Then the 
two endpoints have the same height and the slope of the line connecting the endpoints 
is zero. What can we say about the slope between the endpoints? It shouldn’t take 
much experimentation before you are convinced of the truth of this statement: Somewhere 
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between to and t, the slope is exactly zero, that is, somewhere between to and t, the slope 
is equal to the slope of the line between the endpoints. This suggests that perhaps the same 
is true even if the endpoints are at different heights, and again a bit of experimentation will 
probably convince you that this is so. But we can do better than “experimentation” —we 
can prove that this is so. 

We start with the simplified version: 


THEOREM 6.5.1 Rolle’s Theorem Suppose that f(z) 
interval (a,b), is continuous on the interval [a,b], and f(a) = f(b). Then at some value 
c € (a,b), f’(c) =0. 


has a derivative on the 


Proof. We know that f(x) has a maximum and minimum value on [a,b] (because it 
is continuous), and we also know that the maximum and minimum must occur at an 
endpoint, at a point at which the derivative is zero, or at a point where the derivative is 
undefined. Since the derivative is never undefined, that possibility is removed. 

If the maximum or minimum occurs at a point c, other than an endpoint, where 
f’(c) = 0, then we have found the point we seek. Otherwise, the maximum and minimum 
both occur at an endpoint, and since the endpoints have the same height, the maximum 
and minimum are the same. This means that f(x) = f(a) = f(b) at every x € [a,b], so 
the function is a horizontal line, and it has derivative zero everywhere in (a,b). Then we 
may choose any c at all to get f’(c) = 0. = 


Perhaps remarkably, this special case is all we need to prove the more general one as 


well. 


THEOREM 6.5.2 Mean Value Theorem Suppose that f(x) has a derivative on 
the interval (a,b) and is continuous on the interval [a,b]. Then at some value c € (a,b), 
b) — f(a 

—a 
f (0) — F(a) 
b-—a 
We know that g(x) has a derivative everywhere, since g’(a) = f’(a) —m. We can compute 
g(a) = f(a) — m(a — a) — f(a) = 0 and 


Proof. Let m= , and consider a new function g(x) = f(x)—m(x—a)-— f(a). 
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So the height of g(a) is the same at both endpoints. This means, by Rolle’s Theorem, that 
at some c, g/(c) = 0. But we know that g’(c) = f’(c) — m, so 


0=f(e —m= po - LO=9 
which turns into 
po = HOH), 
exactly what we want. a 


Returning to the original formulation of question (2), we see that if f(t) gives the 
position of your car at time t, then the Mean Value Theorem says that at some time c, 
f'(c) = 70, that is, at some time you must have been traveling at exactly your average 
speed for the trip, and that indeed you exceeded the speed limit. 

Now let’s return to question (1). Suppose, for example, that two functions are known to 
have derivative equal to 5 everywhere, f’(x”) = g’(a) = 5. It is easy to find such functions: 
52, 5a + 47, 54 — 132, etc. Are there other, more complicated, examples? No—the only 
functions that work are the “obvious” ones, namely, 5x plus some constant. How can we 
see that this is true? 

Although “5” is a very simple derivative, let’s look at an even simpler one. Suppose 
that f’(x) = g’/(x) = 0. Again we can find examples: f(x) = 0, f(x) = 47, f(x) = —-511 
all have f’(~) = 0. Are there non-constant functions f with derivative 0? No, and here’s 
why: Suppose that f(a) is not a constant function. This means that there are two points 
on the function with different heights, say f(a) # f(b). The Mean Value Theorem tells 
us that at some point c, f’(c) = (f(b) — f(a))/(b— a) #0. So any non-constant function 
does not have a derivative that is zero everywhere; this is the same as saying that the only 
functions with zero derivative are the constant functions. 

Let’s go back to the slightly less easy example: suppose that f’(x) = g'(x) = 5. Then 
(f(x) — g(a))’ = f'(x) — g(x) = 5—5 =0. So using what we discovered in the previous 
paragraph, we know that f(x) — g(x) =k, for some constant k. So any two functions with 
derivative 5 must differ by a constant; since 5x is known to work, the only other examples 
must look like 5a + k. 

Now we can extend this to more complicated functions, without any extra work. 
Suppose that f’(a) = g/(x). Then as before (f(x) — g(x))’ = f’(a) — g'(x) = 0, so 
f(x) —g(x) =k. Again this means that if we find just a single function g(x) with a certain 
derivative, then every other function with the same derivative must be of the form g(x) +k. 


EXAMPLE 6.5.3 Describe all functions that have derivative 5x — 3. It’s easy to find 
one: g(x) = (5/2)x” — 3x has g'(x) = 5a — 3. The only other functions with the same 
derivative are therefore of the form f(x) = (5/2)? — 3x2 +k. 
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Alternately, though not obviously, you might have first noticed that g(x) = (5/2)? — 
3a + 47 has g’(x) = 52 — 3. Then every other function with the same derivative must 
have the form f(x) = (5/2)”7 — 3a +47+k. This looks different, but it really isn’t. 
The functions of the form f(x) = (5/2)x? — 32 +k are exactly the same as the ones of 
the form f(x) = (5/2)x2 — 32 +47+k. For example, (5/2)x2? — 3x + 10 is the same as 
(5/2)x” — 3a +47 + (—37), and the first is of the first form while the second has the second 


form. 


This is worth calling a theorem: 


THEOREM 6.5.4 If f’(x) = g’(x) for every x € (a,b), then for some constant k, 
f(x) = g(x) + k on the interval (a, b). : 


EXAMPLE 6.5.5 Describe all functions with derivative sin + e”. One such function 
is —cosxz +e”, so all such functions have the form — cosxz + e” +k. 


Exercises 6.5. 
1. Let f(x) = x’. Find a value c € (—1,2) so that f’(c) equals the slope between the endpoints 
of f(x) on [-1,2]. => 
2. Verify that f(x) = x/(x + 2) satisfies the hypotheses of the Mean Value Theorem on the 


interval [1,4] and then find all of the values, c, that satisfy the conclusion of the theorem. 
=> 


3. Verify that f(x) = 32/(a + 7) satisfies the hypotheses of the Mean Value Theorem on the 
interval [—2, 6] and then find all of the values, c, that satisfy the conclusion of the theorem. 


A. Let f(z) = tanz. Show that f(a) = f(27) = 0 but there is no number c € (7, 277) such that 
f'(c) =0. Why does this not contradict Rolle’s theorem? 


5. Let f(x) = (« — 3)~”. Show that there is no value c € (1,4) such that f’(c) = (f(4) — 
f(1))/(4— 1). Why is this not a contradiction of the Mean Value Theorem? 


6. Describe all functions with derivative x” +47¢—-5.> 


1 
7. Describe all functions with derivative ———~. => 
1+ 2? 


1 

8. Describe all functions with derivative x? — —. > 
x 

9. Describe all functions with derivative sin(2x). => 


10. Show that the equation 6x* — 7z + 1 = 0 does not have more than two distinct real roots. 


11. Let f be differentiable on R. Suppose that f’(2) 4 0 for every x. Prove that f has at most 
one real root. 


12. Prove that for all real x and y | cosx — cos y| < |x — y|. State and prove an analogous result 
involving sine. 


13. Show that /I +a <1+(a/2) if-1<2<1. 


C 


Integration 


7.1 Two EXAMPLES 


Up to now we have been concerned with extracting information about how a function 
changes from the function itself. Given knowledge about an object’s position, for example, 
we want to know the object’s speed. Given information about the height of a curve we 
want to know its slope. We now consider problems that are, whether obviously or not, the 
reverse of such problems. 


EXAMPLE 7.1.1 An object moves in a straight line so that its speed at time t is given 
by v(t) = 3t in, say, cm/sec. If the object is at position 10 on the straight line when t = 0, 
where is the object at any time t? 

There are two reasonable ways to approach this problem. If s(t) is the position of the 
object at time t, we know that s’(t) = v(t). Because of our knowledge of derivatives, we 
know therefore that s(t) = 3t?/2+k, and because s(0) = 10 we easily discover that k = 10, 
so s(t) = 3t?/2+10. For example, at t = 1 the object is at position 3/2 +10 = 11.5. This 
is certainly the easiest way to deal with this problem. Not all similar problems are so easy, 
as we will see; the second approach to the problem is more difficult but also more general. 

We start by considering how we might approximate a solution. We know that at t = 0 
the object is at position 10. How might we approximate its position at, say, t = 1? We 
know that the speed of the object at time t = 0 is 0; if its speed were constant then in the 
first second the object would not move and its position would still be 10 when t = 1. In 
fact, the object will not be too far from 10 at t = 1, but certainly we can do better. Let’s 
look at the times 0.1, 0.2, 0.38, ..., 1.0, and try approximating the location of the object 
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at each, by supposing that during each tenth of a second the object is going at a constant 
speed. Since the object initially has speed 0, we again suppose it maintains this speed, but 
only for a tenth of second; during that time the object would not move. During the tenth 
of a second from t = 0.1 to t = 0.2, we suppose that the object is traveling at 0.3 cm/sec, 
namely, its actual speed at t = 0.1. In this case the object would travel (0.3)(0.1) = 0.03 
centimeters: 0.3 cm/sec times 0.1 seconds. Similarly, between t = 0.2 and t = 0.3 the 
object would travel (0.6)(0.1) = 0.06 centimeters. Continuing, we get as an approximation 
that the object travels 


(0.0)(0.1) + (0.3)(0.1) + (0.6)(0.1) +++» + (2.7)(0.1) = 1.35 


centimeters, ending up at position 11.35. This is a better approximation than 10, certainly, 
but is still just an approximation. (We know in fact that the object ends up at position 
11.5, because we’ve already done the problem using the first approach.) Presumably, 
we will get a better approximation if we divide the time into one hundred intervals of a 
hundredth of a second each, and repeat the process: 


(0.0)(0.01) + (0.03) (0.01) + (0.06)(0.01) + +--+ (2.97)(0.01) = 1.485. 


We thus approximate the position as 11.485. Since we know the exact answer, we can see 
that this is much closer, but if we did not already know the answer, we wouldn’t really 
know how close. 

We can keep this up, but we’ll never really know the exact answer if we simply compute 
more and more examples. Let’s instead look at a “typical” approximation. Suppose we 
divide the time into n equal intervals, and imagine that on each of these the object travels 
at a constant speed. Over the first time interval we approximate the distance traveled 
as (0.0)(1/n) = 0, as before. During the second time interval, from t = 1/n to t = 2/n, 
the object travels approximately 3(1/n)(1/n) = 3/n? centimeters. During time interval 
number i, the object travels approximately (3(i — 1)/n)(1/n) = 3(é — 1)/n? centimeters, 
that is, its speed at time (7 — 1)/n, 3(¢ — 1)/n, times the length of time interval number i, 
1/n. Adding these up as before, we approximate the distance traveled as 
(0) $35 +3(2)5 +3(38)5 Baie ce Ns 
centimeters. What can we say about this? At first it looks rather less useful than the 
concrete calculations we’ve already done. But in fact a bit of algebra reveals it to be much 
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more useful. We can factor out a 3 and 1/n? to get 


OUTS at ad), 


that is, 3/n? times the sum of the first n — 1 positive integers. Now we make use of a fact 


you may have run across before: 


k(k + 1) 


ae ee leer, 


In our case we’re interested in k = n — 1, so 


ee faery ee ea) ae 


This simplifies the approximate distance traveled to 


3n?-n 3n?-n_ 3 n? n\ 3 1 1 
n? 2 2 n2 2\n? n2/] 2 nj 
Now this is quite easy to understand: as n gets larger and larger this approximation gets 


closer and closer to (3/2)(1 — 0) = 3/2, so that 3/2 is the exact distance traveled during 
one second, and the final position is 11.5. 


So for t = 1, at least, this rather cumbersome approach gives the same answer as 
the first approach. But really there’s nothing special about t = 1; let’s just call it t¢ 
instead. In this case the approximate distance traveled during time interval number 7 is 
3(i — 1)(t/n)(t/n) = 3(4 — 1)t?/n?, that is, speed 3(4 — 1)(t/n) times time t/n, and the 
total distance traveled is approximately 


0) +3) 4 +32) +32) 4+ +32 Ye 
n n? n2 n2 n2- 


As before we can simplify this to 


3t? 3t? n2—n st (1 ~) 


In the limit, as n gets larger, this gets closer and closer to (3/2)t? and the approximated 
position of the object gets closer and closer to (3/2)t? + 10, so the actual position is 


(3/2)t? + 10, exactly the answer given by the first approach to the problem. 


EXAMPLE 7.1.2 Find the area under the curve y = 3x between x = 0 and any positive 
value x. There is here no obvious analogue to the first approach in the previous example, 
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but the second approach works fine. (Because the function y = 3z is so simple, there 
is another approach that works here, but it is even more limited in potential application 
than is approach number one.) How might we approximate the desired area? We know 
how to compute areas of rectangles, so we approximate the area by rectangles. Jumping 
straight to the general case, suppose we divide the interval between 0 and x into n equal 
subintervals, and use a rectangle above each subinterval to approximate the area under 
the curve. There are many ways we might do this, but let’s use the height of the curve 
at the left endpoint of the subinterval as the height of the rectangle, as in figure 7.1.1. 
The height of rectangle number 7 is then 3(7 — 1)(a/n), the width is x/n, and the area is 
3(i — 1)(a?/n?). The total area of the rectangles is 


(0) = +30)5 +302)55 +3(8)55 iE anc Sn i) 


By factoring out 327/n? this simplifies to 


3x7 Sa7 nn? —-n 68 1 
—_(0+14+2+--- -—1))= S97 Vis); 
“a | +1+4+2+---+(n—-1)) 7 5 5° ( = 


As n gets larger this gets closer and closer to 327/2, which must therefore be the true area 


under the curve. 


Figure 7.1.1 Approximating the area under y = 3a with rectangles. Drag the slider to 
change the number of rectangles. 


What you will have noticed, of course, is that while the problem in the second example 
appears to be much different than the problem in the first example, and while the easy 
approach to problem one does not appear to apply to problem two, the “approximation” 
approach works in both, and moreover the calculations are identical. As we will see, there 
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are many, many problems that appear much different on the surface but that turn out to 
be the same as these problems, in the sense that when we try to approximate solutions we 
end up with mathematics that looks like the two examples, though of course the function 
involved will not always be so simple. 

Even better, we now see that while the second problem did not appear to be amenable 
to approach one, it can in fact be solved in the same way. The reasoning is this: we know 
that problem one can be solved easily by finding a function whose derivative is 3t. We 
also know that mathematically the two problems are the same, because both can be solved 
by taking a limit of a sum, and the sums are identical. Therefore, we don’t really need 
to compute the limit of either sum because we know that we will get the same answer by 
computing a function with the derivative 3t or, which is the same thing, 3x. 

It’s true that the first problem had the added complication of the “10”, and we certainly 
need to be able to deal with such minor variations, but that turns out to be quite simple. 
The lesson then is this: whenever we can solve a problem by taking the limit of a sum of 
a certain form, we can instead of computing the (often nasty) limit find a new function 


with a certain derivative. 


Exercises 7.1. 
1. Suppose an object moves in a straight line so that its speed at time t is given by u(t) = 2t+2, 
and that at t = 1 the object is at position 5. Find the position of the object at t = 2. > 


2. Suppose an object moves in a straight line so that its speed at time t is given by u(t) = es 
and that at t = 0 the object is at position 5. Find the position of the object at t= 2. > 


3. By a method similar to that in example 7.1.2, find the area under y = 2x between x = 0 and 
any positive value for 7. > 


4. By a method similar to that in example 7.1.2, find the area under y = 4x between x = 0 and 
any positive value for 7. > 


5. By a method similar to that in example 7.1.2, find the area under y = 4a between x = 2 and 
any positive value for x bigger than 2. > 


6. By a method similar to that in example 7.1.2, find the area under y = 4x between any two 
positive values for x, say a <b. > 


7. Let f(x) = x” + 3a +2. Approximate the area under the curve between x = 0 and x = 2 
using 4 rectangles and also using 8 rectangles. > 


8. Let f(x) = a — 2a +3. Approximate the area under the curve between x = 1 and x = 3 
using 4 rectangles. => 


7.2 THE FUNDAMENTAL THEOREM OF CALCULUS 


Let’s recast the first example from the previous section. Suppose that the speed of the 
object is 3t at time t. How far does the object travel between time t = a and time t = b? 
We are no longer assuming that we know where the object is at time t = 0 or at any other 
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time. It is certainly true that it is somewhere, so let’s suppose that at t = 0 the position is k. 
Then just as in the example, we know that the position of the object at any time is 3t?/2+k. 
This means that at time t = a the position is 3a7/2 +k and at time t = b the position is 
3b7/2 +k. Therefore the change in position is 3b?/2 + k — (3a7/2 +k) = 3b7/2 — 3a?/2. 
Notice that the k drops out; this means that it doesn’t matter that we don’t know k, it 
doesn’t even matter if we use the wrong k, we get the correct answer. In other words, to 
find the change in position between time a and time b we can use any antiderivative of the 
speed function 3t—it need not be the one antiderivative that actually gives the location of 
the object. 

What about the second approach to this problem, in the new form? We now want to 
approximate the change in position between time a and time b. We take the interval of 
time between a and ), divide it into n subintervals, and approximate the distance traveled 
during each. The starting time of subinterval number 7 is now a + (7 — 1)(b— a)/n, which 
we abbreviate as t;_1, so that t) = a, t, = a+(b-—a)/n, and so on. The speed of the 
object is f(t) = 3t, and each subinterval is (b — a)/n = At seconds long. The distance 
traveled during subinterval number 7 is approximately f(t;-1)At, and the total change in 
distance is approximately 


f(to)At + f(ti)At+ +--+ f(tn—1)At. 


The exact change in position is the limit of this sum as n goes to infinity. We abbreviate 


this sum using sigma notation: 


S- f(tiJAt = f(to)At t+ f(tsJAt+ +--+ f(tn—1)At. 


The notation on the left side of the equal sign uses a large capital sigma, a Greek letter, 
and the left side is an abbreviation for the right side. The answer we seek is 


n-1 
ae At 


Since this must be the same as the answer we have already obtained, we know that 


The significance of 3t?/2, into which we substitute t = b and t = a, is of course that it is 
a function whose derivative is f(t). As we have discussed, by the time we know that we 
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want to compute 
n—-1 


it no longer matters what f(t) stands for—it could be a speed, or the height of a curve, 
or something else entirely. We know that the limit can be computed by finding any 
function with derivative f(t), substituting a and b, and subtracting. We summarize this 
in a theorem. First, we introduce some new notation and terms. 


We write 
b n—-1 
f(t)dt= ey el 


if the limit exists. That is, the left hand side means, or is an abbreviation for, the right 
hand side. The symbol f{ is called an integral sign, and the whole expression is read 
as “the integral of f(t) from a to b.”. What we have learned is that this integral can be 
computed by finding a function, say F(t), with the property that F’(t) = f(t), and then 
computing F'(b) — F(a). The function F(t) is called an antiderivative of f(t). Now the 
theorem: 


THEOREM 7.2.1 Fundamental Theorem of Calculus Suppose that f(x) is 
continuous on the interval |{a, b]. If F(a) is any antiderivative of f(a), then 


b 
i Ade HO) Re). 


Let’s rewrite this slightly: 


[ Oe eee son 


We’ve replaced the variable x by t and b by x. These are just different names for quantities, 
so the substitution doesn’t change the meaning. It does make it easier to think of the two 
sides of the equation as functions. The expression 


/ “f(t at 


is a function: plug in a value for x, get out some other value. The expression F(x) — F(a) 
is of course also a function, and it has a nice property: 


£ (F(0) ~ F(a) = F(a) = f(x), 
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since F(a) is a constant and has derivative zero. In other words, by shifting our point of 
view slightly, we see that the odd looking function 


= [ feat 


has a derivative, and that in fact G’(x) = f(x). This is really just a restatement of the 
Fundamental Theorem of Calculus, and indeed is often called the Fundamental Theorem 
of Calculus. To avoid confusion, some people call the two versions of the theorem “The 
Fundamental Theorem of Calculus, part I” and “The Fundamental Theorem of Calculus, 
part IT”, although unfortunately there is no universal agreement as to which is part I and 
which part II. Since it really is the same theorem, differently stated, some people simply 
call them both “The Fundamental Theorem of Calculus.” 


THEOREM 7.2.2 Fundamental Theorem of Calculus Suppose that f(z) is 
continuous on the interval [a, b] and let 
=f soa 


We have not really proved the Fundamental Theorem. In a nutshell, we gave the 


Then G’(x) = f(a). 


following argument to justify it: Suppose we want to know the value of 


We can interpret the right hand side as the distance traveled by an object whose speed 
is given by f(t). We know another way to compute the answer to such a problem: find 
the position of the object by finding an antiderivative of f(t), then substitute t = a and 
t = b and subtract to find the distance traveled. This must be the answer to the original 
problem as well, even if f(t) does not represent a speed. 

What’s wrong with this? In some sense, nothing. As a practical matter it is a very 
convincing argument, because our understanding of the relationship between speed and 
distance seems to be quite solid. From the point of view of mathematics, however, it 
is unsatisfactory to justify a purely mathematical relationship by appealing to our un- 
derstanding of the physical universe, which could, however unlikely it is in this case, be 
wrong. 

A complete proof is a bit too involved to include here, but we will indicate how it goes. 
First, if we can prove the second version of the Fundamental Theorem, theorem 7.2.2, then 
we can prove the first version from that: 
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Proof of Theorem 7.2.1. | We know from theorem 7.2.2 that 


Gi) = [ f(t) dt 


is an antiderivative of f(x), and therefore any antiderivative F(x) of f(x) is of the form 
F(z) = G(«) +k. Then 


F(b) — F(a) = G(b) +k — (G(a) +k) = G(b) — (a) 


= [ soa- [soar 


It is not hard to see that / f(t) dt = 0, so this means that 


a 


b 
F(b) — F(a) = / f(t) dt, 


which is exactly what theorem 7.2.1 says. = 


So the real job is to prove theorem 7.2.2. We will sketch the proof, using some facts 
that we do not prove. First, the following identity is true of integrals: 


[roa froas [sou 


This can be proved directly from the definition of the integral, that is, using the limits 
of sums. It is quite easy to see that it must be true by thinking of either of the two 
applications of integrals that we have seen. It turns out that the identity is true no matter 
what c is, but it is easiest to think about the meaning when a < c < b. 

First, if f(t) represents a speed, then we know that the three integrals represent the 
distance traveled between time a and time 6; the distance traveled between time a and 
time c; and the distance traveled between time c and time b. Clearly the sum of the latter 
two is equal to the first of these. 

Second, if f(t) represents the height of a curve, the three integrals represent the area 
under the curve between a and b; the area under the curve between a and c; and the area 
under the curve between c and b. Again it is clear from the geometry that the first is equal 
to the sum of the second and third. 
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Proof sketch for Theorem 7.2.2. | We want to compute G’(x), so we start with the 
definition of the derivative in terms of a limit: 


G(x) 


| 


; 1 at+Aa z 
~ arto Az ( i iyat = | F(t) «) 

: 1 v r+Az 
ee (/ F(t) art f f(t) arf FO) «) 


1 ctAz 
lim — / f(t) dt. 


Az>0 Ax 


Now we need to know something about 


[ mre (8) dt 


when Az is small; in fact, it is very close to Axf(x), but we will not prove this. Once 
again, it is easy to believe this is true by thinking of our two applications: The integral 


| a f(t) dt 


can be interpreted as the distance traveled by an object over a very short interval of time. 
Over a sufficiently short period of time, the speed of the object will not change very much, 
so the distance traveled will be approximately the length of time multiplied by the speed at 
the beginning of the interval, namely, Ax f(x). Alternately, the integral may be interpreted 
as the area under the curve between x and x + Az. When Az is very small, this will be 
very close to the area of the rectangle with base Az and height f(x); again this is Ar f(x). 
If we accept this, we may proceed: 


r+Aa 
m af : fwiad= lim BEINE) gay 


li 
Azc>0 Ax Azx—-0 Av 


which is what we wanted to show. | 


It is still true that we are depending on an interpretation of the integral to justify the 
argument, but we have isolated this part of the argument into two facts that are not too 
hard to prove. Once the last reference to interpretation has been removed from the proofs 
of these facts, we will have a real proof of the Fundamental Theorem. 


7.2 The Fundamental Theorem of Calculus 157 


Now we know that to solve certain kinds of problems, those that lead to a sum of a 
certain form, we “merely” find an antiderivative and substitute two values and subtract. 
Unfortunately, finding antiderivatives can be quite difficult. While there are a small number 
of rules that allow us to compute the derivative of any common function, there are no such 
rules for antiderivatives. There are some techniques that frequently prove useful, but we 
will never be able to reduce the problem to a completely mechanical process. 

Because of the close relationship between an integral and an antiderivative, the integral 
sign is also used to mean “antiderivative”. You can tell which is intended by whether the 


2 
| xv? dx 
1 


is an ordinary integral, also called a definite integral, because it has a definite value, 


2 oS 4s 
93 13 7 
2 
pee ee egy 
[ta=5 rae 


[eu 


to denote the antiderivative of x, also called an indefinite integral. So this is evaluated 


limits of integration are included: 


namely 


We use 


as 
3 
[ean-tre 


It is customary to include the constant C’ to indicate that there are really an infinite 
number of antiderivatives. We do not need this C to compute definite integrals, but in 
other circumstances we will need to remember that the C’ is there, so it is best to get 
into the habit of writing the C. When we compute a definite integral, we first find an 
antiderivative and then substitute. It is convenient to first display the antiderivative and 
then do the substitution; we need a notation indicating that the substitution is yet to be 
done. A typical solution would look like this: 


2 3 
/ eee = 
i 3 


The vertical line with subscript and superscript is used to indicate the operation “substitute 


2 


Ge Berg 


and subtract” that is needed to finish the evaluation. 
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Exercises 7.2. 


Find the antiderivatives of the functions: 


21. Find the derivative of G(x) 


22. Find the derivative of G(x) = 


1. 8/r => 2. 3P +1 = 
3. 4//z > 4 Die 
5. Ts = 6. (5r+1)? > 
7. (c-—6)/ > 8. 2/2 > 
2 
9. == 10. |2t¢-—4| > 
Ov | | 
Compute the values of the integrals: 
4 Tw 
11. / t? + 3tdt > 12. y sint dt > 
1 0 
10 4 5 
13. / ie 14. [ ea 
i; 0 
3 2 
15. [ a dx > 16. / a dx > 
0 1 
17. Find the derivative of G(x) = i # —3tdt > 
1 
a2 
18. Find the derivative of G(x) = / i? —3tdi => 
1 
19. Find the derivative of G(x) = / e dt > 
i 
a2 
2 
20. Find the derivative of G(x) = / e dt => 
1 
I 
I 


7.3 SOME PROPERTIES OF INTEGRALS 


Suppose an object moves so that its speed, or more properly velocity, is given by u(t) = 
—t? + 5t, as shown in figure 7.3.1. Let’s examine the motion of this object carefully. 
We know that the velocity is the derivative of position, so position is given by s(t) = 
—t?/3 + 5t?/2 + C. Let’s suppose that at time t = 0 the object is at position 0, so 
s(t) = —t?/3 + 5t?/2; this function is also pictured in figure 7.3.1. 

Between t = 0 and t = 5 the velocity is positive, so the object moves away from the 
starting point, until it is a bit past position 20. Then the velocity becomes negative and 
the object moves back toward its starting point. The position of the object at t = 5 is 
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Figure 7.3.1 The velocity of an object and its position. 


exactly s(5) = 125/6, and at t = 6 it is s(6) = 18. The total distance traveled by the 
object is therefore 125/6 + (125/6 — 18) = 71/3 © 23.7. 
As we have seen, we can also compute distance traveled with an integral; let’s try it. 


6 : -8 5.,|° 
| oat = | —t? + 5tdt = ——+—-t?| =18. 
) 0 3 2 Io 
What went wrong? Well, nothing really, except that it’s not really true after all that “we 
can also compute distance traveled with an integral”. Instead, as you might guess from this 
example, the integral actually computes the net distance traveled, that is, the difference 
between the starting and ending point. 
As we have already seen, 


[ow dt = [ww dt + [ow dt. 


Computing the two integrals on the right (do it!) gives 125/6 and —17/6, and the sum of 
these is indeed 18. But what does that negative sign mean? It means precisely what you 
might think: it means that the object moves backwards. To get the total distance traveled 
we can add 125/6 + 17/6 = 71/3, the same answer we got before. 

Remember that we can also interpret an integral as measuring an area, but now we 
see that this too is a little more complicated that we have suspected. The area under the 
curve u(t) from 0 to 5 is given by 


and the “area” from 5 to 6 is 
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In other words, the area between the x-axis and the curve, but under the z-axis, “counts 


6 
| w(i)de = 18 
0 


measures “net area”, the area above the axis minus the (positive) area below the axis. 


as negative area”. So the integral 


If we recall that the integral is the limit of a certain kind of sum, this behavior is not 
surprising. Recall the sort of sum involved: 


n-1 
S| u(t) At. 
1=0 


In each term v(t) At the At is positive, but if u(t;) is negative then the term is negative. If 
over an entire interval, like 5 to 6, the function is always negative, then the entire sum is 
negative. In terms of area, uv(t)At is then a negative height times a positive width, giving 
a negative rectangle “area”. 


So now we see that when evaluating 


by finding an antiderivative, substituting, and subtracting, we get a surprising answer, but 
one that turns out to make sense. 
Let’s now try something a bit different: 


5 


5 43 Bee 
t) dt = — —t 
[ow +3 


ae ae ae tee” ae 8 


Here we simply interchanged the limits 5 and 6, so of course when we substitute and 
subtract we’re subtracting in the opposite order and we end up multiplying the answer 
by —1. This too makes sense in terms of the underlying sum, though it takes a bit more 
thought. Recall that in the sum 


n-1 
S- o(tiAt, 
1=0 


the At is the “length” of each little subinterval, but more precisely we could say that 
At = t;41 —t;, the difference between two endpoints of a subinterval. We have until now 
assumed that we were working left to right, but could as well number the subintervals from 
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right to left, so that tp) = b and t, =a. Then At = t,41 — 1; is negative and in 


the values u(t;) are negative but also At is negative, so all terms are positive again. On 
the other hand, in 


the values u(t;) are positive but At is negative,and we get a negative result: 


0) 3 0 3 
at 5 —5 5 

[ e@a= +30 Srna a eee 
5 3 2 


3 2 


5 


Finally we note one simple property of integrals: 


if f(x) + g(x) dz = [5 de fate) dx. 


This is easy to understand once you recall that (F(x) + G(x))’ = F’(#) + G’ (x). Hence, if 
F'(x) = f(x) and G’(x) = g(x), then 
b 
| f@)+o(e)ae = (F@) +6) 


= F(b) + G(b) — F(a) — G(a) 
= F(b) — F(a) + G(b) — G(a) 
= F(x)|, + G(x)\ 


= [ seaes foe) 


In summary, we will frequently use these properties of integrals: 


[serar= [sears [ soyae 


[serroear= [sears [ eae 


[s@e=- fie 
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and if a< band f(x) <0 on [a,}] then 


and in fact 


Exercises 7.3. 


1. 


An object moves so that its velocity at time t is v(t) = —9.8t+ 20 m/s. Describe the motion 
of the object between ¢ = 0 and t = 5, find the total distance traveled by the object during 
that time, and find the net distance traveled. => 


An object moves so that its velocity at time t is v(t) = sint. Set up and evaluate a single 
definite integral to compute the net distance traveled between t = 0 and t = 27. > 


An object moves so that its velocity at time ¢ is v(t) = 1+2sint m/s. Find the net distance 
traveled by the object between ¢ = 0 and t = 27, and find the total distance traveled during 
the same period. > 


Consider the function f(x) = (x + 2)(x + 1)(a — 1)(# — 2) on [—2,2]. Find the total area 
between the curve and the x-axis (measuring all area as positive). = 


Consider the function f(#) = x? — 3a +2 on [0,4]. Find the total area between the curve 
and the x-axis (measuring all area as positive). > 


Evaluate the three integrals: 


3 4 3 
A= | (—a? +9) dx B= | (—a? + 9) dx c= | (—a” + 9) dx, 
0 0 4 


and verify that A= B+C. => 


8 


Techniques of Integration 


Over the next few sections we examine some techniques that are frequently successful when 
seeking antiderivatives of functions. Sometimes this is a simple problem, since it will be 
apparent that the function you wish to integrate is a derivative in some straightforward 


jae dx 


we realize immediately that the derivative of x‘’ will supply an 2'°: (a'')’ = 11r'°. We 
don’t want the “11”, but constants are easy to alter, because differentiation “ignores” them 


way. For example, faced with 


in certain circumstances, so 


d 1 11 1 10 10 
oe ae oll et Sai WO 10 
Gea to. 


From our knowledge of derivatives, we can immediately write down a number of an- 


tiderivatives. Here is a list of those most often used: 


grtl 
eS if —1 
fe dx pekqe ia 
[ota =inle|+e 
[ea=ere 
[suede =-cose + 
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[cosede = sine +C 
[scndr =tane+C 
[sccotane dr = sex +€ 


1 
/up# =arctanz+C 
1+ 2? 


1 
————. dx = arcsinz +C 
/ V1 — x? 


8.1 SUBSTITUTION 


Needless to say, most problems we encounter will not be so simple. Here’s a slightly more 
complicated example: find 


je cos(a*) dx. 


This is not a “simple” derivative, but a little thought reveals that it must have come from 
an application of the chain rule. Multiplied on the “outside” is 27, which is the derivative 
of the “inside” function x7. Checking: 


sO 


je cos(a*) dx = sin(x”) + C. 


Even when the chain rule has “produced” a certain derivative, it is not always easy to 
see. Consider this problem: 


[ov — x? dx. 


There are two factors in this expression, x? and \/1 — 22, but it is not apparent that the 


chain rule is involved. Some clever rearrangement reveals that it is: 
1 
fev —g° dr = [or (-5)] (1 = (1 —27))s/1 — 2? de. 


This looks messy, but we do now have something that looks like the result of the chain 
rule: the function 1 — 2? has been substituted into —(1/2)(1 — x)/z, and the derivative 
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of 1 — a”, —2x, multiplied on the outside. If we can find a function F(x) whose derivative 
is —(1/2)(1 — x) /x we'll be done, since then 


© Fel) = oP ae) 9) (-3) AC) ore 


dx 
= 2/1 — 2x? 


But this isn’t hard: 
d 1, a2 3/2 
—5(1 — 2) Ve de = —5@ —x°!/*) dx (8.1.1) 


— 12 372 2 572 
= 5 (5 5? +C 


1 1 


So finally we have 


[evi-Paz = (Fa —27")- 5) (1 —27)9/2 40. 


5 


So we succeeded, but it required a clever first step, rewriting the original function so 
that it looked like the result of using the chain rule. Fortunately, there is a technique that 
makes such problems simpler, without requiring cleverness to rewrite a function in just the 
right way. It does sometimes not work, or may require more than one attempt, but the 
idea is simple: guess at the most likely candidate for the “inside function”, then do some 
algebra to see what this requires the rest of the function to look like. 

One frequently good guess is any complicated expression inside a square root, so we 
start by trying u = 1 — x”, using a new variable, u, for convenience in the manipulations 
that follow. Now we know that the chain rule will multiply by the derivative of this inner 


function: 
du 
= —22, 


a 


so we need to rewrite the original function to include this: 


Recall that one benefit of the Leibniz notation is that it often turns out that what looks 


like ordinary arithmetic gives the correct answer, even if something more complicated is 
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going on. For example, in Leibniz notation the chain rule is 


dy _ dy dt 


dx dt dx’ 
The same is true of our current expression: 
ie de = i © Vidas 
Now we're almost there: since u = 1 — 2”, 7 = 1 — u and the integral is 
1 
[30 —u)/udu. 


It’s no coincidence that this is exactly the integral we computed in (8.1.1), we have simply 
renamed the variable u to make the calculations less confusing. Just as before: 


: ee 
[30 wyadu= (Zu 3) +C. 


Then since u = 1— 27: 


[oevi-Paz = (50 —2)- 5) G27 7A ac 


To summarize: if we suspect that a given function is the derivative of another via the 
chain rule, we let w denote a likely candidate for the inner function, then translate the 
given function so that it is written entirely in terms of u, with no x remaining in the 
expression. If we can integrate this new function of u, then the antiderivative of the 
original function is obtained by replacing u by the equivalent expression in x. 

Even in simple cases you may prefer to use this mechanical procedure, since it often 
helps to avoid silly mistakes. For example, consider again this simple problem: 


jr cos(a*) dx. 


Let u = x”, then du/dx = 2x or du = 2x dx. Since we have exactly 2x dx in the original 
integral, we can replace it by du: 


je cos(x”) dx = [cosudu =sinu+C = sin(z”) +C. 


This is not the only way to do the algebra, and typically there are many paths to the 
correct answer. Another possibility, for example, is: Since du/dx = 2x, dx = du/2x, and 
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then the integral becomes 


| 2xc0s(a?) dr = [2xcosu = f cosudu. 


The important thing to remember is that you must eliminate all instances of the original 
variable x. 


EXAMPLE 8.1.1 Evaluate [lox +b)" dx, assuming that a and 6 are constants, a # 0, 


and n is a positive integer. We let u = ax + b so du = adz or dx = du/a. Then 


1 1 1 
"dg = | —u™ du = ———u""! — brett ©, 
[torso dx i du Gre +C oat ae be 


EXAMPLE 8.1.2 Evaluate i sin(axz + b) dx, assuming that a and 6 are constants and 


a #0. Again we let u= ax +b so du =adz or dx = du/a. Then 


1 1 1 
[snlar +0 des / 7 Sinudu = qt os) +C= ae cos(az +b) +C. 


4 
EXAMPLE 8.1.3 Evaluate | x sin(x”) dx. First we compute the antiderivative, then 
2 
evaluate the definite integral. Let u = 2? so du = 2x dx or xdx = du/2. Then 
1 1 1 
[ esint? = i 5 sinudu = ae cosu) + C = ae cos(x”) + C. 


Now 


1 4 


4 
i ax sin(a*) dx = —~cos(x?) 
9 2 


A somewhat neater alternative to this method is to change the original limits to match 


1 1 
= 5 cos(16) + 5 cos(4). 


2 


the variable u. Since u = x”, when x = 2, u = 4, and when x = 4, u = 16. So we can do 
this: 
16 


1 1 
aes cos(16) + —cos(4). 


4 16 1 1 
/ vsin(a?) de = | 3 Sinudu = 5 (cos u) 5 


An incorrect, and dangerous, alternative is something like this: 
4 


4 


4 


1 1 
= —<cos(x?)| = as cos(16) + 5 cos(4). 


4 44 1 
ip x sin(x”) dx = | — sinudu = —= cos(u) 
2 ee 2 


2 2 


4 
This is incorrect because i, 5 sin udu means that u takes on values between 2 and 4, which 
2 


4 


and forget 
2 


1 
is wrong. It is dangerous, because it is very easy to get to the point oe cos(u) 
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1 1 
to substitute 2? back in for u, thus getting the incorrect answer = cos(4) + 5 cos(2). A 


somewhat clumsy, but acceptable, alternative is something like this: 


ae if = cos(16) — cos(4) 


| x sin(«”) dx = i —sinudu = —=cos(u) 
2 - 2 2 


=2 


1/2 r 
EXAMPLE 8.1.4 Evaluate / a) 
1/4 sin*(mt) 


du/m = cos(rt) dt. We change the limits to sin(1/4) = V2/2 and sin(1/2) = 1. Then 
ue t ome esl | Lut 
[eae [same [a tqu= 24 

1/4 Sin (rt) J2/2 7 U J2/2 7 av =1 


Exercises 8.1. 


dt. Let u = sin(at) so du = mcos(nt) dt or 


1 if V2 


V2/2 T T 


Find the antiderivatives or evaluate the definite integral in each problem. 


1; [o-ora => 2. [+P ae => 
1 
3. a(x? +1)? de > 4. /a==4 
/ V1 — 5t 
5. [sw xcosx dx => 6. [ev 100 — x? dz > 
T: / _ dx => 8. [ costae cos(sin(mt)) dL = 
9. out dz => 10. tanadz > 
cos? x 
11. i sin? (3a) cos(3a) dx > 12. [ sec" ctanzdxz => 
0 
Va/2 
13. | a sec’ (a7) tan(x”) dx > 14, / a) dz => 
0 cos? x 
4 n/6 
1 
15. | Gn—72 dg => 16. i (cos? x — sin? x) dx > 
3. 0 
17 a ai = 18 (2x° — 1)(a* — 22)° dx > 
, (22 — 7)1/9 © a 


19. [ sin’ edz > 20. [fore da => 


1 
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8.2 POWERS OF SINE AND COSINE 


Functions consisting of products of the sine and cosine can be integrated by using substi- 
tution and trigonometric identities. These can sometimes be tedious, but the technique is 
straightforward. Some examples will suffice to explain the approach. 


EXAMPLE 8.2.1 Evaluate i sin? x dx. Rewrite the function: 


[sods = [sve sin* «dx = [ snetsin? x)? dx = [snea — cos? x)? dx. 
Now use u = cosx, du = — sina dz: 


[sinz( costa) = f 0 - wd 
= f -Q-2? +u!)du 


2 1 
= Seu = te eC 


2 1 
=—cosx+ 3 cost — = cosa +C. 


EXAMPLE 8.2.2 Evaluate [sn vde. Use sin? 2 = (1 — cos(2x))/2 to rewrite the 


= 3 
[sw2ds fin?ayrar = [ LSE? ap 


1 
a aE — 3cos 2x + 3cos? 2x — cos® 2x dr. 


function: 


Now we have four integrals to evaluate: 
i Hs Boe 


3 
[300820 de — = sin 2x” 


and 
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are easy. The cos? 22 integral is like the previous example: 


/ — cos? 2x7 dx = J — cos 2x cos” 2x dx 


1 
-/- 3 
1 u3 
46-9) 

1/. sin? 2x 
5 (sinze - 3 ): 


And finally we use another trigonometric identity, cos” 2 = (1 + cos(2x))/2: 


1 A 3 in 4 
[30ost2ear=3 f =S ae =F (24 *). 


So at long last we get 
3 it in? 2 3 in4 
[sviods et 7 —_ 1g nee - T6 (sin2x - — *) + 16 («+ ==) +C. 


EXAMPLE 8.2.3 Evaluate [sw x cos’ «da. Use the formulas sin? = (1—cos(22)) /2 


— cos 2a(1 — sin? 2x) dx 


= @) du 


and cos” x = (1+ cos(2x))/2 to get: 


1—- 2 1 2 
[ six? wos? rade -/ cost v) . feos v) dx. 


The remainder is left as an exercise. 


Exercises 8.2. 
Find the antiderivatives. 


sin? xdz > 2. sin? «dz > 


. 4 2 
sin xcdxz > 4. cos’ xsin° xdx => 


. 2 2 
sin* xcos’ xdx => 


3... 2 3/2 
cos’ xsin® xdx => 8. / 


sin z(cosx)°/* dx => 


sec” «csc? «dx => 10. tan® xsecadx > 


/ 
/ 
5. [cost edz — 6. 
/ 
; 


Pc ie is ee 
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8.3 TRIGONOMETRIC SUBSTITUTIONS 


So far we have seen that it sometimes helps to replace a subexpression of a function by 
a single variable. Occasionally it can help to replace the original variable by something 
more complicated. This seems like a “reverse” substitution, but it is really no different in 
principle than ordinary substitution. 


EXAMPLE 8.3.1 Evaluate / V1—22 dz. Let x =sinu so dx = cosudu. Then 


[vin Bae | Vi sin wcosudu= f Ve0s* 00s udu. 


We would like to replace Vcos? u by cosu, but this is valid only if cos u is positive, since 
V cos? u is positive. Consider again the substitution xz = sin u. We could just as well think 
of this as u = arcsin x. If we do, then by the definition of the arcsine, —7/2 < u < 1/2, so 
cosu > 0. Then we continue: 


1 2 in2 
[veo icosudu =f cos?udu= fF gu = 24 Esc 


arcsinx  sin(2arcsinx 
ee ok ( r ) cr 


This is a perfectly good answer, though the term sin(2 arcsin x) is a bit unpleasant. It is 
possible to simplify this. Using the identity sin22 = 2sinxcosx, we can write sin2u = 


2sin ucosu = 2sin(arcsin x) V1 — sin? u = 2xy/1 —sin?(arcsina) = 22\/1 — x?. Then the 


full antiderivative is 


arcsinzn 2¢/1l—22~ arcsinzg «V1—2? 
5 + m _ 5 + 5 +C. 


This type of substitution is usually indicated when the function you wish to integrate 
contains a polynomial expression that might allow you to use the fundamental identity 


2 


sin? x + cos? x = 1 in one of three forms: 


cos? = 1—- sin? x sec? ¢ =1+tan? a tan? « = sec? x — 1. 
If your function contains 1—”, as in the example above, try « = sin u; if it contains 1+ 2? 
try « = tanu; and if it contains x? — 1, try = secu. Sometimes you will need to try 
something a bit different to handle constants other than one. 
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EXAMPLE 8.3.2 Evaluate / V4 — 92? dx. We start by rewriting this so that it looks 


more like the previous example: 


| Vi- 98 ax = f JAA = BaP) dx = f 2/1 Be /2P? ae. 


Now let 37/2 = sinu so (3/2) dx = cosudu or dx = (2/3) cosudu. Then 


4 
[2vi=GepPae= f2 1 — sin? u (2/3) cosudu = ;_| cos? udu 


_ 4u Asin2u 


C 
6 ar 12 = 
2 in(3dx/2 2 si 
= arcsin(3x/2) , 2sinucosu |G 
3 3 
2arcsin(3xz/2)  2sin(arcsin(3x/2)) cos(arcsin(32/2)) 
= + a 
3 3 
2 in( 3x /2 2(3x/2),/1 — (32/2)? 
_ arcsin( w/2) | (32/2) (32/2) LC 
3 3 
2 in( 3a /2 4 — 9x? 
_ ares e/2) Fi ea : be 16, 


using some of the work from example 8.3.1. 


EXAMPLE 8.3.3 Evaluate Jv 1+22dz. Let « =tanu, dx = sec” udu, so 


[Vivace = | i+ tan? usec? udu= f VsecF usec? udu. 
Since u = arctan(x), —7/2 <u < 7/2 and secu > 0, so Vsec? u = secu. Then 


/ Vsec2 usec? udu = | sec? u du. 


In problems of this type, two integrals come up frequently: - sec? udu and J secudu. 


Both have relatively nice expressions but they are a bit tricky to discover. 
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First we do { sec udu, which we will need to compute ‘ sec? u du: 
secu + tan u 
secudu = | secu ——— —— du 
secu + tan u 


sec? u + sec utan u 
secu + tan u 


Now let w = secu+tanu, dw = secutanu + sec? udu, exactly the numerator of the 
function we are integrating. Thus 


2 

t 1 

[sccudu= [= poe! at du = [ dw =In|w|+C 
secu + tan u Ww 


=In|secu+tanu|+C. 


Now for / sec? u du: 


3 secbu sec?u— sec?u—s (tan? u + 1) secu 
sec’ u = - = Si 
2 2 2 2 
_ sec? u a sec utan? u 4, Secu sec? u + sec u tan? u , secu 
— 2 20. 2 2 


We already know how to integrate sec u, so we just need the first quotient. This is “simply” 
a matter of recognizing the product rule in action: 


if sec? u + secu tan? udu = sec utan u. 


So putting these together we get 


t ] t 
[setudu= secutant n|secu + tan u| ns) 
2 2 
and reverting to the original variable x: 
[vivea o sec u tan wu 4 In| secu + tan u| 4C 
2 2 

_ sec(arctan x) tan(arctan x) * In | sec(arctan x) + tan(arctan x)| We 

7 2 2 

aa a In|V1+27+4+2| LC 

“2 2 


using tan(arctan x) = x and sec(arctan x) = Vl + tan(arctanz) = /1+4 2?. 
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Exercises 8.3. 


Find the antiderivatives. 


1. [scxae > 2. [ost nde > 
3. [Ve 1az > A, [ Vora de = 
5. [evi-Bar = 6. [evi-ea = 


1 
7. ———— dr > 8. [Ve+ 2dr = 
lx 


1 x 
=n ms It => 10. dx => 
ces : la= 
Jax / xv 
11. dx => 12. ———. dz > 
J/1l—2 V4r2 — 1 


8.4 INTEGRATION BY PARTS 


We have already seen that recognizing the product rule can be useful, when we noticed 
that 

[sc u+secutan? udu = sec utanu. 
As with substitution, we do not have to rely on insight or cleverness to discover such 


antiderivatives; there is a technique that will often help to uncover the product rule. 
Start with the product rule: 


+ fla)ale) = F@)al2) + F(2)o'(o) 


We can rewrite this as 
fag(e) = f F@g(e)ax+ | fle)g'(e) ax, 
and then 


i, fay Ga faye) / I (a)g(e) de. 


This may not seem particularly useful at first glance, but it turns out that in many cases 
we have an integral of the form 


/ f(a)g! (a) de 
but that 
fl fi(a)g(x) de 


is easier. This technique for turning one integral into another is called integration by 
parts, and is usually written in more compact form. If we let u = f(x) and v = g(x) then 
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du = f’(a) dx and dv = g'(x) dx and 
[ude =w~ fvdu. 
To use this technique we need to identify likely candidates for u = f(x) and dv = g’(x) dz. 


EXAMPLE 8.4.1 Evaluate [emeae. Let u = Inz so du = 1/xdz. Then we must 


let dv = xdzx so v = x7/2 and 


[einede= "8 — [2 a= fae S 4 
a 


2 2 


EXAMPLE 8.4.2 Evaluate [ esinvac. Let wu = x so du = dx. Then we must let 


dv = sinx dz so v = —cosx and 


[esinzde = -neos2— [ -cosedr=-noose+ f cosrdx = —reosz+ sine + 


EXAMPLE 8.4.3 Evaluate i: sec? x dx. Of course we already know the answer to this, 


but we needed to be clever to discover it. Here we’ll use the new technique to discover the 
antiderivative. Let u = seca and dv = sec? edz. Then du = secxtanxdzx and v = tanz 
and 


[scfedr=secatane— f tan? xsecxde 
=secatana — f (sec? e — 1) sec ade 


=secatans— f sec’rde+ f secsrdz, 
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At first this looks useless—we’re right back to / sec? x dx. But looking more closely: 


[scknde=secatans— f sec®rde+ f secs dz 


[schnae+ secre = seertane + | secw dr 


2 f sec? rdr=seortane + [seo dr 


t 1 
sec «dx = =n" F [sccode 
2 2 
secxtanz In|secx+tanz 
=a | 5 Lec. 


EXAMPLE 8.4.4 Evaluate ia sina dx. Let u = x”, dv = sinx dz; then du = 2x dx 


and v = —cosz. Now fe sina dx = —x* cosx + jr cosxdz. This is better than the 


original integral, but we need to do integration by parts again. Let u = 2x, dv = cosx dz; 
then du = 2 and v = sin, and 


[Psinede =-2? cose + f 2xcosedr 


= —7? cosx + 2x sina — [esineae 


= —x7* cosx + 2rsinz +2cosx+C. 


Such repeated use of integration by parts is fairly common, but it can be a bit tedious to 
accomplish, and it is easy to make errors, especially sign errors involving the subtraction in 
the formula. There is a nice tabular method to accomplish the calculation that minimizes 
the chance for error and speeds up the whole process. We illustrate with the previous 
example. Here is the table: 


or 
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To form the first table, we start with wu at the top of the second column and repeatedly 
compute the derivative; starting with dv at the top of the third column, we repeatedly 
compute the antiderivative. In the first column, we place a “—” in every second row. To 
form the second table we combine the first and second columns by ignoring the boundary; 


14 


if you do this by hand, you may simply start with two columns and add a “—” to every 
second row. 

To compute with this second table we begin at the top. Multiply the first entry in 
column u by the second entry in column dv to get —x? cos z, and add this to the integral 


of the product of the second entry in column u and second entry in column dv. This gives: 
—x* cosx + / 2x cos x dx, 


or exactly the result of the first application of integration by parts. Since this integral is 
not yet easy, we return to the table. Now we multiply twice on the diagonal, (x2*)(— cos 2) 
and (—2x)(— sin x) and then once straight across, (2)(— sin x), and combine these as 


—x* cosa + 2x sinx — [esincas, 


giving the same result as the second application of integration by parts. While this integral 
is easy, we may return yet once more to the table. Now multiply three times on the diagonal 
to get (x*)(—cosx), (—2xr)(—sinx), and (2)(cosx), and once straight across, (0)(cos x). 
We combine these as before to get 


-a? cost + 2rsine +2eosx+ f 0dr = —r* cosx + 2rsinx+2cosr+C. 


Typically we would fill in the table one line at a time, until the “straight across” multipli- 
cation gives an easy integral. If we can see that the wu column will eventually become zero, 
we can instead fill in the whole table; computing the products as indicated will then give 
the entire integral, including the “+C'”, as above. 


Exercises 8.4. 


Find the antiderivatives. 


1. [ vcosnde = 2. [cosede = 


3. [ee dx > A. [oor dx > 


5. [sear > 6. [ocde = 
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3 


7. [ varctan ede => 8. fe sinx dx => 

9. fe cosxdx => 10. [esn? xdx => 
11. [ esinecosde => 12. [ sxetan(vi) ac => 
13. [suiva) aS 14. [ sec! acsc’ «dx > 


8.5 RATIONAL FUNCTIONS 


A rational function is a fraction with polynomials in the numerator and denominator. 
For example, 
= 1 g? +1 
x? +4 —6' (x — 3)?’ x? — 1? 


are all rational functions of x. There is a general technique called “partial fractions” 
that, in principle, allows us to integrate any rational function. The algebraic steps in the 
technique are rather cumbersome if the polynomial in the denominator has degree more 
than 2, and the technique requires that we factor the denominator, something that is not 
always possible. However, in practice one does not often run across rational functions with 
high degree polynomials in the denominator for which one has to find the antiderivative 
function. So we shall explain how to find the antiderivative of a rational function only 
when the denominator is a quadratic polynomial ax? + ba + c. 

We should mention a special type of rational function that we already know how to 
integrate: If the denominator has the form (az + b)”, the substitution u = ax + b will 
always work. The denominator becomes u”, and each x in the numerator is replaced by 
(u — b)/a, and dx = du/a. While it may be tedious to complete the integration if the 
numerator has high degree, it is merely a matter of algebra. 
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3 


= dx. Using the substitution u = 3 — 2x we get 


x 
EXAMPLE 8.5.1 Find — 
= } (3 — 22) 


3 
3 uns en ae 7 
/ x eke wa 2) u== {> Que + 27u Zee 


(3 — 2x) —2 uw 16 uP 
1 
= 16 i OG OTE = OTE al 
= 1 fu7! 9u-? is Q7u-2 = 27u-4 LC 
~ 16 \ -1 =—2 3 ee 
i (GSI 0G —27)-. 23. 27 a) 
_ 1 (@=22)? _ 9(3-20)-? | 27(8-20)® _ 27(8-2a)-*) |, 
16 —1 —2 -3 —4 
1 9 9 27 


ma = C 
163-22) | 328-22) 16(3—22)3 | 643—22)) + 


We now proceed to the case in which the denominator is a quadratic polynomial. We 
can always factor out the coefficient of x? and put it outside the integral, so we can assume 
that the denominator has the form x? + br +c. There are three possible cases, depending 
on how the quadratic factors: either x? + br + ¢ = (x — r)(x — s), x7 +ba +e= (x—1)?, 


or it doesn’t factor. We can use the quadratic formula to decide which of these we have, 
and to factor the quadratic if it is possible. 


EXAMPLE 8.5.2 Determine whether x? + 2 +1 factors, and factor it if possible. The 
quadratic formula tells us that «7 + « + 1 = 0 when 


-l+vl1-4 
on 
2 


Since there is no square root of —3, this quadratic does not factor. 


EXAMPLE 8.5.3 Determine whether x? — x — 1 factors, and factor it if possible. The 
quadratic formula tells us that x? — x — 1 = 0 when 


14V14+4 145 
i = : 
D 2 


GH -X-1L= [| r— 5) Lo 5) : 


Therefore 
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If x? + be +c = (x — rr)? then we have the special case we have already seen, that can 
be handled with a substitution. The other two cases require different approaches. 
If x? + br +c = (x#—r)(x — 8), we have an integral of the form 


fener” 


where p(x) is a polynomial. The first step is to make sure that p(a) has degree less than 
2: 


Ana 


(x — 2)(a + 3) 
that has degree less than 2. To do this we use long division of polynomials to discover that 


dx in terms of an integral with a numerator 


EXAMPLE 8.5.4 Rewrite 


x? x? iba 7x —6 4: 7x —6 
=> = 20-— SSeS SS ES 5 
(c—2)(v1+3) a2%+a2-6 x2+a—6 (x — 2)(x + 3) 


| (c— ne a [e ae +f (x See i 


The first integral is easy, so only the second requires some work. 


sO 


Now consider the following simple algebra of fractions: 


A nm B Ag=s)>Ba=T) (Ap Bie Ae Ber 


g—-r 2¢—-s (#—r)(ex—s) — (x —r)(x — 8) 


That is, adding two fractions with constant numerator and denominators (~—r) and (x—s) 
produces a fraction with denominator (x — r)(x — s) and a polynomial of degree less than 
2 for the numerator. We want to reverse this process: starting with a single fraction, we 
want to write it as a sum of two simpler fractions. An example should make it clear how 


to proceed. 
EXAMPLE 8.5.5 Evaluat / ai dx. We start by writi ie 
5. valuate xz. We star writin 
(e—2)(«+3) ¥ © (c—2)(a +3) 
as the sum of two fractions. We want to end up with 
7z —6 A 5 


(x — 2)(x +3) rae BS 


If we go ahead and add the fractions on the right hand side we get 
7x —6 (A+ B)x+3A-—2B 


(x — 2)(x + 3) (x — 2)(a +3) 


So all we need to do is find A and B so that 77 — 6 = (A+ B)x + 3A — 2B, which is to 
say, we need 7 = A+ B and —6 = 3A — 2B. This is a problem you’ve seen before: solve a 
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system of two equations in two unknowns. There are many ways to proceed; here’s one: If 
7 = A+B then B = 7—A and so —6 = 83A—-2B = 3A—2(7-A) = 3A—1442A = 5A-14. 
This is easy to solve for A: A = 8/5, and then B=7—-— A=7-8/5 = 27/5. Thus 
7x —6 8 1 4 ae 8 27 
di = dz = =|In|x -—2/+ —1 3] + C. 
erences " [erst eg Sea ele 


The answer to the original problem is now 


leary n= fx—rde+ f E-Hea i 


2 


8 Zi 
Sage oh 5 nla —2|+ — nie +3|+C. 


Now suppose that x? + bz +c doesn’t factor. Again we can use long division to ensure 
that the numerator has degree less than 2, then we complete the square. 


oe 
x? +4748 
factor. We could complete the square and use a trigonometric substitution, but it is simpler 


EXAMPLE 8.5.6 Evaluate i dx. The quadratic denominator does not 


to rearrange the integrand: 


jose - / Ee - ees * 
e2+4er4+8 ° Jf 22+4r4+8 e2+4¢4+8 °° 


The first integral is an easy substitution problem, using u = x? + 4x + 8: 


xr+2 1 fdu 1 
dx = = —]n |x? + 4a + 8]. 
[= 2 a u 2 mee eee 


For the second integral we complete the square: 
2 
2 
g?t+47+8=(£42)?+4=4 (3 ) 1) 


making the integral 


2 
Using u = —- we get 


1 1 1 2 1 xr+2 
1 Te. dx = 1 po = 9 arctan ae 
Co ae! ss 


The final answer is now 


1 1 1 2 
f eRe pine 4 te +8] — 5 arctan (72) +C. 
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Exercises 8.5. 


Find the antiderivatives. 
1. — dx => 2 ee 

Ao 

4 — 4-2 

2 

x 
=> 4 —_dz > 
4-2 - 
1 


dx 
x? wet 25 


>) 


| 
see a 
5. f pads . fsa”? 
ie © fri 
|maem 2 


7. 


x 
dx => 
x? v2 +102 + 21 


7 
t 


9. dx => 10 3 = 


2 
2 
sie x24+3 


8.6 NUMERICAL INTEGRATION 


We have now seen some of the most generally useful methods for discovering antiderivatives, 
and there are others. Unfortunately, some functions have no simple antiderivatives; in such 
cases if the value of a definite integral is needed it will have to be approximated. We will 
see two methods that work reasonably well and yet are fairly simple; in some cases more 
sophisticated techniques will be needed. 

Of course, we already know one way to approximate an integral: if we think of the 
integral as computing an area, we can add up the areas of some rectangles. While this 
is quite simple, it is usually the case that a large number of rectangles is needed to get 
acceptable accuracy. A similar approach is much better: we approximate the area under a 
curve over a small interval as the area of a trapezoid. In figure 8.6.1 we see an area under 
a curve approximated by rectangles and by trapezoids; it is apparent that the trapezoids 
give a substantially better approximation on each subinterval. 


Figure 8.6.1 Approximating an area with rectangles and with trapezoids. 


As with rectangles, we divide the interval into n equal subintervals of length Ax. A 


typical trapezoid is pictured in figure 8.6.2; it has area L(x) + f(t) Ax. If we add up 
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the areas of all trapezoids we get 


Foo) + Fler), , Her) + Fle) 9, , Hen + Flom) 9 
2 2 2 
(AS + Fea) + flea) +--+ flea) + 2S") Ae 


This is usually known as the Trapezoid Rule. For a modest number of subintervals this 
is not too difficult to do with a calculator; a computer can easily do many subintervals. 


(xi, f(w:)) 


Figure 8.6.2 A single trapezoid. 


In practice, an approximation is useful only if we know how accurate it is; for example, 
we might need a particular value accurate to three decimal places. When we compute a 
particular approximation to an integral, the error is the difference between the approxi- 
mation and the true value of the integral. For any approximation technique, we need an 
error estimate, a value that is guaranteed to be larger than the actual error. If A is an 
approximation and F is the associated error estimate, then we know that the true value 
of the integral is between A— E and A+ EL. In the case of our approximation of the 
integral, we want & = E(Az) to be a function of Az that gets small rapidly as Az gets 
small. Fortunately, for many functions, there is such an error estimate associated with the 


trapezoid approximation. 


THEOREM 8.6.1 Suppose f has a second derivative f’” everywhere on the interval 
[a,b], and |f’’(x)| < M for all x in the interval. With Az = (b—a)/n, an error estimate 
for the trapezoid approximation is 


E(Ac) = 74 M (Ae) ae 


Let’s see how we can use this. 
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1 
EXAMPLE 8.6.2 Approximate i e~* dx to two decimal places. The second deriva- 
0 


tive of f = e~® is (4a? =e; and it is not hard to see that on {0, 1], (402 —2)e-” | <2, 
We begin by estimating the number of subintervals we are likely to need. To get two dec- 
imal places of accuracy, we will certainly need E(Ax) < 0.005 or 


(2) < 0.005 
12." n2 ; 


1 
g (200) <n? 


100 


With n = 6, the error estimate is thus 1/6° < 0.0047. We compute the trapezoid approxi- 
mation for six intervals: 
( + f(1/6) + (2/6) +--+ + (5/6) + ”| 5 © 0.74512. 

So the true value of the integral is between 0.74512 — 0.0047 = 0.74042 and 0.74512 + 
0.0047 = 0.74982. Unfortunately, the first rounds to 0.74 and the second rounds to 0.75, 
so we can’t be sure of the correct value in the second decimal place; we need to pick a larger 
n. As it turns out, we need to go to n = 12 to get two bounds that both round to the same 
value, which turns out to be 0.75. For comparison, using 12 rectangles to approximate 
the area gives 0.7727, which is considerably less accurate than the approximation using six 
trapezoids. 

In practice it generally pays to start by requiring better than the maximum possible 
error; for example, we might have initially required E(Az) < 0.001, or 


1 1 
—(2)— < 0.001 
ia! 2p = 


i 
g (1000) <n? 


12.91 = ~ <n 


Had we immediately tried n = 13 this would have given us the desired answer. 


The trapezoid approximation works well, especially compared to rectangles, because 
the tops of the trapezoids form a reasonably good approximation to the curve when Az is 
fairly small. We can extend this idea: what if we try to approximate the curve more closely, 
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by using something other than a straight line? The obvious candidate is a parabola: if we 
can approximate a short piece of the curve with a parabola with equation y = ax?+bx+c, 
we can easily compute the area under the parabola. 

There are an infinite number of parabolas through any two given points, but only 
one through three given points. If we find a parabola through three consecutive points 
(xi, f(vi)), (Gi4a, f(Vi41)), (Cite, f(Li42)) on the curve, it should be quite close to the 
curve over the whole interval |x;,7;+2], as in figure 8.6.3. If we divide the interval [a, 5] 
into an even number of subintervals, we can then approximate the curve by a sequence of 
parabolas, each covering two of the subintervals. For this to be practical, we would like a 
simple formula for the area under one parabola, namely, the parabola through (2;, f(z;)), 
(ti+1, f(ai41)), and (x42, f(vi+2)). That is, we should attempt to write down the parabola 
y = ax? + bx +c through these points and then integrate it, and hope that the result is 
fairly simple. Although the algebra involved is messy, this turns out to be possible. The 
algebra is well within the capability of a good computer algebra system like Sage, so we 
will present the result without all of the algebra; you can see how to do it in this Sage 
worksheet. 

To find the parabola, we solve these three equations for a, b, and c: 

f (as) = a(ai41 — Ax)? + b(ai41 — Av) +e 
f (igi) = a(@i41)? + O(wi41) + 
f(zin2) = (ey, + Ax)? + (aj. + Az) +e 


Not surprisingly, the solutions turn out to be quite messy. Nevertheless, Sage can easily 
compute and simplify the integral to get 


ri41tAaz Ag 
/ ax? + br +cdz = gz (F (ts) + AF (@i41) + f(i+2)). 
vi41—-Ax 


Now the sum of the areas under all parabolas is 


AE (£00) +f 1) + F(a) + F (a2) +4 f(e8) + f(a) + “+ f(tn—2) +4 f (@n—-1) + f(@n)) = 


AX (f(20) + Af (21) + 2f(@2) + 4f (wa) + 2f (wa) +--+ + 2f(@n—-2) + 4f(tn-1) + f(en))- 


| 
This is just slightly more complicated than the formula for trapezoids; we need to remember 
the alternating 2 and 4 coefficients; note that n must be even for this to make sense. This 
approximation technique is referred to as Simpson’s Rule. 
As with the trapezoid method, this is useful only with an error estimate: 
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(rite, f(i42)) 


i it it2 
Figure 8.6.3. A parabola (dashed) approximating a curve (solid). 


THEOREM 8.6.3 Suppose f has a fourth derivative f( everywhere on the interval 
[a,b], and |f“(x)| < M for all x in the interval. With Ax = (b — a)/n, an error estimate 


for Simpson’s approximation is 


b-a 


Ne) Tg 


M(Az)* = ~——— 


1 
EXAMPLE 8.6.4 Let us again approximate i e~® dx to two decimal places. The 
0 


fourth derivative of f = e~* is (16a? — 48x? + 12)e"*’; on [0,1] this is at most 12 in 
absolute value. We begin by estimating the number of subintervals we are likely to need. 
To get two decimal places of accuracy, we will certainly need E(Azx) < 0.005, but taking 
a cue from our earlier example, let’s require E(Az) < 0.001: 


: (12) d < 0.001 
180 n4 : 
200 ‘i 
ee 
3 
200 
2.362 "4 aa 2 


So we try n = 4, since we need an even number of subintervals. Then the error estimate 
is 12/180/4* < 0.0003 and the approximation is 


il 


~ 0.746855. 
3-4 


(F(0) + 4f (1/4) + 2f(1/2) + 4f(3/4) + f1)) 


So the true value of the integral is between 0.746855 — 0.0003 = 0.746555 and 0.746855 + 
0.0003 = 0.7471555, both of which round to 0.75. 
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Exercises 8.6. 


In the following problems, compute the trapezoid and Simpson approximations using 4 subin- 
tervals, and compute the error estimate for each. (Finding the maximum values of the second 
and fourth derivatives can be challenging for some of these; you may use a graphing calculator 
or computer software to estimate the maximum values.) If you have access to Sage or similar 
software, approximate each integral to two decimal places. You can use this Sage worksheet to 
get started. 


3 3 
1. / xcdx => 2. i x dx > 
1 ) 


4 3 1 
3. | ae dx => 4. 7 —_dz=> 
2 1 & 


1 
6. [ sviFeae = 
0 
Br is 1 
7. / dz => 8. [ ve ¥iae = 
1 0 
1 4 
9. [ vet¥iae = 10. [ vi¥ifear > 
0 1 


(| 
a, 
a 
+ 
re 
Q 
8 
s 


11. Using Simpson’s rule on a parabola f(x), even with just two subintervals, gives the exact value 
of the integral, because the parabolas used to approximate f will be f itself. Remarkably, 
Simpson’s rule also computes the integral of a cubic function f(#) = ax* + ba? + cx +d 
exactly. Show this is true by showing that 


i f(a) da = >" (F (wo) + 4f((wo + x2)/2) + f(a2)). 


3:2 


This does require a bit of messy algebra, so you may prefer to use Sage. 


8.7 ADDITIONAL EXERCISES 


These problems require the techniques of this chapter, and are in no particular order. Some 
problems may be done in more than one way. 


1. [erora = 2; pe — 9)? dt => 
3. [ie + 16)te” dt > A, [ sintcos 2tdt > 
2t+1 
5. [ tantsec? tat = 6. Jatue 
?+¢t+3 
vf a—pte 8. | amt 
; t(t? — 4) : (25 — #2)3/2 
cos 3t 2 
9. dt > 10. [ tsec tdt > 
Vsin 3t 


11 12. [costtat = 


[Rue 
a Pad 
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1 1 
13. ——_ d= 14. —____ dt => 
fans Pe 
15 _ sect 16. | #/#2+1dt> 
; (1 + tan t)8 ; 
17. Je sintdt > 18. (er + 47)? Vt dt > 
19 / a dt > 20 / i dt => 
' J Q—P)572 " J 49 + 4t?) 
arctan 2t t 
DA cee |g 22. —_____dt= 
i 1+ 4t? faa 
23. [ six? toos! tat = 24. /—am dt => 
#2 —6t+9 
25. lupe = 26. [taneeat > 
t(In t)? 
fa 
O77. jee dt > 28. i; + dt > 
PATA 
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Applications of Integration 


9.1 AREA BETWEEN CURVES 


We have seen how integration can be used to find an area between a curve and the z-axis. 
With very little change we can find some areas between curves; indeed, the area between 
a curve and the x-axis may be interpreted as the area between the curve and a second 
“curve” with equation y = 0. In the simplest of cases, the idea is quite easy to understand. 


EXAMPLE 9.1.1 Find the area below f(x) = —a? + 42 + 3 and above g(x) = —x* + 
7x? —10x +5 over the interval 1 < x < 2. In figure 9.1.1 we show the two curves together, 
with the desired area shaded, then f alone with the area under f shaded, and then g alone 
with the area under g shaded. 


Y y Y 
A A A 
10 5 10 + 10 + 
| 
| 
5 5 54 
| I | | 
| I | | 
| I | | 
| i | | 
| I | | 
| I | | 
0+ >X O >xX 0 > x 
1 2 3 0 1 2 3 0 1 2 3 


Figure 9.1.1 Area between curves as a difference of areas. 
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It is clear from the figure that the area we want is the area under f minus the area 


under g, which is to say 


[ terae— f aeyae= fi se) Ca ia 


It doesn’t matter whether we compute the two integrals on the left and then subtract or 
compute the single integral on the right. In this case, the latter is perhaps a bit easier: 


2 2 
| F(x) — g(z) de = | —a? + 4¢+3— (—2° + 7x? — 102 +5) dx 
1 1 


2 
-| xv? — 8a? + 142 — 2dax 
1 


2 


a 8a" 9 
= Ss —2 

i g + @ an 

16 64 1 8 
= — — — + 28-4-—(—--—-=4+7-2 

. aes ae ae 
es 

¢ 4 12 


It is worth examining this problem a bit more. We have seen one way to look at it, 
by viewing the desired area as a big area minus a small area, which leads naturally to the 
difference between two integrals. But it is instructive to consider how we might find the 
desired area directly. We can approximate the area by dividing the area into thin sections 
and approximating the area of each section by a rectangle, as indicated in figure 9.1.2. 
The area of a typical rectangle is Ax(f(x;) — g(2;)), so the total area is approximately 


This is exactly the sort of sum that turns into an integral in the limit, namely the integral 


fF 402)- 92) 


Of course, this is the integral we actually computed above, but we have now arrived at it 
directly rather than as a modification of the difference between two other integrals. In that 
example it really doesn’t matter which approach we take, but in some cases this second 


approach is better. 
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10 - 


0 | 
@) 1 2 3 


Figure 9.1.2 Approximating area between curves with rectangles. 


EXAMPLE 9.1.2 Find the area below f(a) = —2? + 4x + 1 and above g(a) = —a? + 
7x? — 10x +3 over the interval 1 < x < 2; these are the same curves as before but lowered 
by 2. In figure 9.1.3 we show the two curves together. Note that the lower curve now dips 
below the x-axis. This makes it somewhat tricky to view the desired area as a big area 
minus a smaller area, but it is just as easy as before to think of approximating the area 
by rectangles. The height of a typical rectangle will still be f(x;) — g(x;), even if g(x;) is 
negative. Thus the area is 


2 P) 
/ -a 4 dn +1 (-0° +727 = 100 +3)de= fa — 82? + Ue - 2de, 
1 1 


This is of course the same integral as before, because the region between the curves is 


identical to the former region—it has just been moved down by 2. 


Figure 9.1.3 Area between curves. 


EXAMPLE 9.1.3 Find the area between f(x) = —x? 4+ 4x and g(x) = «* — 6a +5 over 
the interval 0 < x < 1; the curves are shown in figure 9.1.4. Generally we should interpret 
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“area” in the usual sense, as a necessarily positive quantity. Since the two curves cross, 
we need to compute two areas and add them. First we find the intersection point of the 


curves: 
—7? +47 = 27? —6r4+5 


0 =2r7-10r+5 


7 10+ V100—40 54+ V15 
a 4 — 2° 


The intersection point we want is x = a = (5 — V15)/2. Then the total area is 


a 1 
i: 2 6045—(-2?+40)de+ | —a? + 4a — (a? —62 +5) dz 
0) a 


1 


[202-1004 5a0+ f —27? +102 —5dax 
0 a 


a 1 


203 203 

= — ~522452| + —-— +52? — 5a 
3 0 3 a 
52 

=-> +5v15, 


after a bit of simplification. 


Figure 9.1.4 Area between curves that cross. 


EXAMPLE 9.1.4 Find the area between f(x) = —x? + 4x and g(x) = x” — 6x +5; 
the curves are shown in figure 9.1.5. Here we are not given a specific interval, so it must 
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be the case that there is a “natural” region involved. Since the curves are both parabolas, 


the only reasonable interpretation is the region between the two intersection points, which 


we found in the previous example: 


5 V15 
5. * 


If we let a = (5 — V15)/2 and b = (5 + V15)/2, the total area is 


b b 
[ -a?+42-(@- 6245) de= | —2¢" + 10x —5dzx 


9 3 
== + 5x? — 5a 
3 a 
= dv 15. 
after a bit of simplification. 
3) 
0 
1 5 
=F = 


Figure 9.1.5 Area bounded by two curves. 


Exercises 9.1. 


Find the area bounded by the curves. 


1. 


= 
= 


oe a eo ee 


y=u'—2 andy=2" (the part to the right of the y-axis) > 
c=y anz=y’=> 

g=1-y’ and y= -x-1=> 

z=3y—-y° andx+y=3> 

y = cos(ra/2) and y = 1 — 2” (in the first quadrant) > 

y = sin(ra#/3) and y = x (in the first quadrant) > 

y = Vz and y = —_— 

y=Veandy=Vez+1,0<24<45 
e=Oandr=25-ys 


y=sinxcosx andy=singz,0<x*%<7> 
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11. y=a? and y= 27/3 > 
12. y=2?—-2¢ and y=r#-2> 


The following three exercises expand on the geometric interpretation of the hyperbolic functions. 
Refer to section 4.11 and particularly to figure 4.11.2 and exercise 6 in section 4.11. 
13. Compute / Vx? —1dz using the substitution u = arccoshx, or x = coshu; use exercise 6 


in section 4.11. 


14. Fix t > 0. Sketch the region R in the right half plane bounded by the curves y = x tanht, 
y = —xtanht, and x? — y? = 1. Note well: t is fixed, the plane is the x-y plane. 


15. Prove that the area of R is t. 


9.2 DISTANCE, VELOCITY, ACCELERATION 
We next recall a general principle that will later be applied to distance-velocity-acceleration 


b 
problems, among other things. If F’(u) is an anti-derivative of f(u), then / f(u) du = 


F(b) — F(a). Suppose that we want to let the upper limit of integration vary, i.e., we 
replace b by some variable x. We think of a as a fixed starting value xg. In this new 
notation the last equation (after adding F(a) to both sides) becomes: 


Fg) =F (ag) +f f (u) du. 


9 


(Here u is the variable of integration, called a “dummy variable,” since it is not the variable 


in the function F(x). In general, it is not a good idea to use the same letter as a variable 
x 

of integration and as a limit of integration. That is, / f(a)dz is bad notation, and can 
Xo 


lead to errors and confusion.) 

An important application of this principle occurs when we are interested in the position 
of an object at time t (say, on the z-axis) and we know its position at time to. Let s(t) 
denote the position of the object at time t¢ (its distance from a reference point, such as 
the origin on the z-axis). Then the net change in position between to and t is s(t) — s(to). 
Since s(t) is an anti-derivative of the velocity function u(t), we can write 


t 
s(t) = s(to) +f v(u)du. 
to 
Similarly, since the velocity is an anti-derivative of the acceleration function a(t), we have 


u(t) = v(to) +f a(u)du. 


to 
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EXAMPLE 9.2.1 Suppose an object is acted upon by a constant force F’. Find v(t) 
and s(t). By Newton’s law F' = ma, so the acceleration is F'/m, where m is the mass of 
the object. Then we first have 


} F 
=Uo+ —(t — to), 
m 


Be F 
v(t) = v(t) + f —du=v+ —u 
nee m 


to 
using the usual convention vp = u(to). Then 
t 


seis) +f ( - =u 2 to) dese es lu ~ to)?) 


to to 


F 
= 89 + vo(t — to) + at — to)” 


For instance, when F’/m = —g is the constant of gravitational acceleration, then this is 
the falling body formula (if we neglect air resistance) familiar from elementary physics: 


g 
89 + vo(t — to) — a(t — to)?, 


or in the common case that to = 0, 


si UEE = at 


Recall that the integral of the velocity function gives the net distance traveled, that is, 
the displacement. If you want to know the total distance traveled, you must find out where 
the velocity function crosses the t-axis, integrate separately over the time intervals when 
v(t) is positive and when v(t) is negative, and add up the absolute values of the different 
integrals. For example, if an object is thrown straight upward at 19.6 m/sec, its velocity 
function is v(t) = —9.8t + 19.6, using g = 9.8 m/sec? for the force of gravity. This is a 
straight line which is positive for t < 2 and negative for t > 2. The net distance traveled 
in the first 4 seconds is thus 


4 
| (—9.8t + 19.6)dt = 0, 
0 
while the total distance traveled in the first 4 seconds is 


2 4 
i (—9.8¢ + 19.6)dt + | (—9.8¢ + 19.6)dt] = 19.6 + | — 19.6] = 39.2 
0) 2 


meters, 19.6 meters up and 19.6 meters down. 
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EXAMPLE 9.2.2 The acceleration of an object is given by a(t) = cos(at), and its 
velocity at time t = 0 is 1/(27). Find both the net and the total distance traveled in the 
first 1.5 seconds. 

We compute 


= “(5 + sin(xt)). 


The net distance traveled is then 


3/2 1/1 
/ a (5 + sin(rt)) dt 


: ie + 3 ~ 0.340 meters. 


5(3/2) — s(0) 


To find the total distance traveled, we need to know when (0.5 + sin(at)) is positive and 
when it is negative. This function is 0 when sin(zt) is —0.5, i.e., when zt = 77/6, 1177/6, 
etc. The value zt = 77/6, i.e., t = 7/6, is the only value in the range 0 < t < 1.5. Since 
u(t) > 0 for t < 7/6 and v(t) < 0 for t > 7/6, the total distance traveled is 


WOU TL... 2471. . 
/ = (5 + sit) ) dt + LL, = (5 + sin(xt)) at| 


i a | 1 ins: (OF wT 

= 5 (ay + 5 oonTa/6) +=) + =|7 — 35 + 5 eon(7n/6)| 
wie. 1478. 4 13 7 #1¥V3 

= (Rothe 2) ae b+ 298 | ~ ome 


Exercises 9.2. 
For each velocity function find both the net distance and the total distance traveled during the 
indicated time interval (graph v(t) to determine when it’s positive and when it’s negative): 
1 t= cosh); 0 5 25S 
v=—9.8t+49,0<t<10> 
v = 38(t-—3)((-1),,0<t<55> 
v=sin(rt/3) -—t,0<t<1=> 
An object is shot upwards from ground level with an initial velocity of 2 meters per second; 


it is subject only to the force of gravity (no air resistance). Find its maximum altitude and 
the time at which it hits the ground. > 


Se 


6. An object is shot upwards from ground level with an initial velocity of 3 meters per second; 
it is subject only to the force of gravity (no air resistance). Find its maximum altitude and 
the time at which it hits the ground. > 


10. 


11. 
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. An object is shot upwards from ground level with an initial velocity of 100 meters per second; 


it is subject only to the force of gravity (no air resistance). Find its maximum altitude and 
the time at which it hits the ground. > 


. An object moves along a straight line with acceleration given by a(t) = — cos(t), and s(0) = 1 


and v(0) = 0. Find the maximum distance the object travels from zero, and find its maximum 
speed. Describe the motion of the object. > 


. An object moves along a straight line with acceleration given by a(t) = sin(zt). Assume that 


when t = 0, s(t) = v(t) = 0. Find s(t), v(t), and the maximum speed of the object. Describe 
the motion of the object. > 

An object moves along a straight line with acceleration given by a(t) = 1+sin(t). Assume 
that when t = 0, s(t) = v(t) = 0. Find s(t) and v(t). > 

An object moves along a straight line with acceleration given by a(t) = 1 — sin(zt). Assume 
that when t = 0, s(t) = v(t) = 0. Find s(t) and v(t). => 


9.3 VOLUME 


We have seen how to compute certain areas by using integration; some volumes may also 


be computed by evaluating an integral. Generally, the volumes that we can compute this 


way have cross-sections that are easy to describe. 


Figure 9.3.1 Volume of a pyramid approximated by rectangular prisms. (AP) 


EXAMPLE 9.3.1 Find the volume of a pyramid with a square base that is 20 meters 
tall and 20 meters on a side at the base. As with most of our applications of integration, we 


begin by asking how we might approximate the volume. Since we can easily compute the 


volume of a rectangular prism (that is, a “box”), we will use some boxes to approximate 
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the volume of the pyramid, as shown in figure 9.3.1: on the left is a cross-sectional view, on 
the right is a 3D view of part of the pyramid with some of the boxes used to approximate 
the volume. 

Each box has volume of the form (22;)(27;)Ay. Unfortunately, there are two variables 
here; fortunately, we can write x in terms of y: « = 10 — y/2 or x; = 10 —y;/2. Then the 
total volume is approximately 


n-1 


Y_ 4(10 — y5/2)?Ay 


i=0 


and in the limit we get the volume as the value of an integral: 


- a (20-y)3/?? —o8——-208~——-8000 
4(10 - y/2)? dy = f Coe oe 
[ 0 3 0 3 3 3 


As you may know, the volume of a pyramid is (1/3) (height)(area of base) = (1/3)(20)(400), 
which agrees with our answer. 


EXAMPLE 9.3.2 The base of a solid is the region between f(a) = 2? — 1 and g(x) = 
—x? + 1, and its cross-sections perpendicular to the z-axis are equilateral triangles, as 
indicated in figure 9.3.2. The solid has been truncated to show a triangular cross-section 
above x = 1/2. Find the volume of the solid. 


HT 

rH 
HHA 
HIiTtHY 


-1 


Figure 9.3.2 Solid with equilateral triangles as cross-sections. (AP) 


A cross-section at a value x; on the z-axis is a triangle with base 2(1— 2?) and height 


V3(1 — 27), so the area of the cross-section is 


+ (base) (height) = (1 — 27) V3(1 — 23), 
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” is then 


and the volume of a thin “sla 
(1 — #7)V3(1 — «?)Ag. 


Thus the total volume is 


: 16 
/ V3(1 — 27)? de = — V3. 
7 15 


One easy way to get “nice” cross-sections is by rotating a plane figure around a line. 
For example, in figure 9.3.3 we see a plane region under a curve and between two vertical 
lines; then the result of rotating this around the z-axis, and a typical circular cross-section. 


{ | 
a 


Figure 9.3.3 A solid of rotation. (AP) 


Of course a real “slice” of this figure will not have straight sides, but we can approxi- 
mate the volume of the slice by a cylinder or disk with circular top and bottom and straight 
sides; the volume of this disk will have the form mr?7Ax. As long as we can write r in terms 
of x we can compute the volume by an integral. 


EXAMPLE 9.3.3 Find the volume of a right circular cone with base radius 10 and 
height 20. (A right circular cone is one with a circular base and with the tip of the cone 
directly over the center of the base.) We can view this cone as produced by the rotation 
of the line y = x/2 rotated about the x-axis, as indicated in figure 9.3.4. 

At a particular point on the z-axis, say x;, the radius of the resulting cone is the 
y-coordinate of the corresponding point on the line, namely y; = x;/2. Thus the total 


volume is approximately 
n—-1 


n(x;/2)? dx 


and the exact volume is 


4°" "43°83 


[ x 7202 20007 
r—d a 
(0) 
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Figure 9.3.4 A region that generates a cone; approximating the volume by circular disks. 
(AP) 


Note that we can instead do the calculation with a generic height and radius: 


np? 9 rr? he rr7h 
Tse dx = = , 
0 


h? hee 3 


giving us the usual formula for the volume of a cone. 


EXAMPLE 9.3.4 Find the volume of the object generated when the area between 
y =x" and y = z is rotated around the z-axis. This solid has a “hole” in the middle; we 
can compute the volume by subtracting the volume of the hole from the volume enclosed 
by the outer surface of the solid. In figure 9.3.5 we show the region that is rotated, the 
resulting solid with the front half cut away, the cone that forms the outer surface, the 
horn-shaped hole, and a cross-section perpendicular to the x-axis. 

We have already computed the volume of a cone; in this case it is 7/3. At a particular 
value of x, say x;, the cross-section of the horn is a circle with radius rae so the volume of 


1 1 1 
| n(x?)? dz = : nae dx = n=, 
0 0 a) 


so the desired volume is 7/3 — 7/5 = 27/15. 
As with the area between curves, there is an alternate approach that computes the 


the horn is 


desired volume “all at once” by approximating the volume of the actual solid. We can 
approximate the volume of a slice of the solid with a washer-shaped volume, as indicated 
in figure 9.3.5. 

The volume of such a washer is the area of the face times the thickness. The thickness, 
as usual, is Az, while the area of the face is the area of the outer circle minus the area of 
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Figure 9.3.5 Solid with a hole, showing the outer cone and the shape to be removed to 
form the hole. (AP) 


the inner circle, say 7R? — rr. In the present example, at a particular x;, the radius R is 
a; and r is 27. Hence, the whole volume is 


al 3 5 1 
2 4 x x 1 1 21 
_— d = —_—_ — — = --— = —, 
[ Tx dx ac alt "(5 | 5 


Of course, what we have done here is exactly the same calculation as before, except we 


have in effect recomputed the volume of the outer cone. 


Suppose the region between f(a) = x +1 and g(x) = (a — 1)? is rotated around the 
y-axis; see figure 9.3.6. It is possible, but inconvenient, to compute the volume of the 
resulting solid by the method we have used so far. The problem is that there are two 
“kinds” of typical rectangles: those that go from the line to the parabola and those that 
touch the parabola on both ends. To compute the volume using this approach, we need to 
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break the problem into two parts and compute two integrals: 


nf + vi? -(.- vaPay+e f (1+ Vy)? —(y—-1)? dy = St tem = am. 


1 

0 
If instead we consider a typical vertical rectangle, but still rotate around the y-axis, we 
get a thin “shell” instead of a thin “washer”. If we add up the volume of such thin shells 
we will get an approximation to the true volume. What is the volume of such a shell? 
Consider the shell at 7;. Imagine that we cut the shell vertically in one place and “unroll” 
it into a thin, flat sheet. This sheet will be almost a rectangular prism that is Az thick, 
f(a) — g(x;) tall, and 27x; wide (namely, the circumference of the shell before it was 
unrolled). The volume will then be approximately the volume of a rectangular prism with 
these dimensions: 272x;(f(x;) — g(a;))Ax. If we add these up and take the limit as usual, 
we get the integral 


27 


9 di = "9 1 1)*)\de= 
/ mx( f(x) — g(x)) oa | nma(x +1—(2-1)*) OT 


Not only does this accomplish the task with only one integral, the integral is somewhat 
easier than those in the previous calculation. Things are not always so neat, but it is 
often the case that one of the two methods will be simpler than the other, so it is worth 
considering both before starting to do calculations. 


4 44 
34 37 
2 os 
i 1 — A 
0 | 0 | 
0 1 2 3 0 1 2 3 


Figure 9.3.6 Computing volumes with “shells”. (AP) 


EXAMPLE 9.3.5 Suppose the area under y = —x? + 1 between x = 0 and z = 1 is 
rotated around the x-axis. Find the volume by both methods. 


1 
Disk method: : m(1— 2)? dx = —n. 
0 


1 
Shell method: | 2ry,/1 — ydy = at 
0 
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Exercises 9.3. 


1. 


10. 


11. 


12. 


13. 


1 
. Verify that i n(1— 27)? de = —n. 
0 


1 
. Verify that i 2ryVl—ydy= =n. 


1 4 
Verify that 7 (1+ VBP (1 Va ay +m | (1+ Vy)? -(y ly = ant ge ™ 
0 i 


3 
. Verify that [ Qna(az +1 — (ae —1)*) dx = a 


0) 


0) 


. Use integration to find the volume of the solid obtained by revolving the region bounded by 


x+y = 2 and the x and y axes around the z-axis. > 


. Find the volume of the solid obtained by revolving the region bounded by y = x — x” and 


the x-axis around the z-axis. > 


. Find the volume of the solid obtained by revolving the region bounded by y = Vsin x between 


x =0 and x = 7/2, the y-axis, and the line y = 1 around the z-axis. > 


. Let S be the region of the xy-plane bounded above by the curve uy = 64, below by the line 


y = 1, on the left by the line x = 2, and on the right by the line x = 4. Find the volume of 
the solid obtained by rotating S around (a) the z-axis, (b) the line y = 1, (c) the y-axis, (d) 
the line x = 2. > 


. The equation 27/9 + y?/4 = 1 describes an ellipse. Find the volume of the solid obtained 


by rotating the ellipse around the z-axis and also around the y-axis. These solids are called 
ellipsoids; one is vaguely rugby-ball shaped, one is sort of flying-saucer shaped, or perhaps 
squished-beach-ball-shaped. => 


Figure 9.3.7 —_Ellipsoids. 


Use integration to compute the volume of a sphere of radius r. You should of course get the 
well-known formula 47r°/3. 


A hemispheric bowl of radius r contains water to a depth h. Find the volume of water in the 
bowl. => 


The base of a tetrahedron (a triangular pyramid) of height h is an equilateral triangle of side 
s. Its cross-sections perpendicular to an altitude are equilateral triangles. Express its volume 
V as an integral, and find a formula for V in terms of h and s. Verify that your answer is 
(1/3) (area of base) (height). 

The base of a solid is the region between f(x) = cosx and g(x) = —cosa, —7/2< a4 < 1/2, 
and its cross-sections perpendicular to the x-axis are squares. Find the volume of the solid. 
=> 


204 Chapter 9 Applications of Integration 


9.4 AVERAGE VALUE OF A FUNCTION 


The average of some finite set of values is a familiar concept. If, for example, the class 
scores on a quiz are 10, 9, 10, 8, 7, 5, 7, 6, 3, 2, 7, 8, then the average score is the sum of 
these numbers divided by the size of the class: 


10+9+10+8+7+5+7+64+34+2+74+8 82 
12 ae: 


Y 6.83. 


average score = 


Suppose that between t = 0 and t = 1 the speed of an object is sin(at). What is the 
average speed of the object over that time? The question sounds as if it must make sense, 
yet we can’t merely add up some number of speeds and divide, since the speed is changing 
continuously over the time interval. 

To make sense of “average” in this context, we fall back on the idea of approximation. 
Consider the speed of the object at tenth of a second intervals: sin 0, sin(0.17), sin(0.27), 
sin(0.37),..., sin(0.97). The average speed “should” be fairly close to the average of these 
ten speeds: 

LS /10) ~ 46.3 = 0.63 

rp ah i 19 03 = 0.83. 
Of course, if we compute more speeds at more times, the average of these speeds should 
be closer to the “real” average. If we take the average of n speeds at evenly spaced times, 
we get: 


1 n—-1 
— » sin(mi/n). 
ws=0 


Here the individual times are t; = i/n, so rewriting slightly we have 


n-1 

i 

e > sin(mt;). 
1=0 


This is almost the sort of sum that we know turns into an integral; what’s apparently 
missing is At—but in fact, At = 1/n, the length of each subinterval. So rewriting again: 


n-1 n-1 

il 
y sin(at;)— = y sin(at;) At. 
i=0 nN 40 


Now this has exactly the right form, so that in the limit we get 


cos(rt) |" 
T 


9 
_ _ cost) , cost) _ 2 _ 6366 = 0.64. 
0 Ms - . 


1 
average speed = | sin(at) dt = 
0 


It’s not entirely obvious from this one simple example how to compute such an average 
in general. Let’s look at a somewhat more complicated case. Suppose that the velocity 
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of an object is 16t? + 5 feet per second. What is the average velocity between t = 1 and 
t = 3? Again we set up an approximation to the average: 


1 n—-1 
— \ © 16t? +5, 
We i=0 


where the values t; are evenly spaced times between 1 and 3. Once again we are “missing” 
At, and this time 1/n is not the correct value. What is At in general? It is the length of 
a subinterval; in this case we take the interval [1,3] and divide it into n subintervals, so 
each has length (3 — 1)/n = 2/n = At. Now with the usual “multiply and divide by the 


same thing” trick we can rewrite the sum: 


n n-1 _ _— 
it 1 i _ 1% 
= 16t? +5 = —— 16t? as == ‘1682 5) = ‘(1682 5)A 


In the limit this becomes 


1 1446 _ 223 
— | 16t?+5dt= cui 
2 [ = -_—a 3 


Does this seem reasonable? Let’s picture it: in figure 9.4.1 is the velocity function together 
with the horizontal line y = 223/3 = 74.3. Certainly the height of the horizontal line looks 
at least plausible for the average height of the curve. 


150 = 
125 
100 = 


79 4 


50 


0 
0 1 2 3 


Figure 9.4.1 Average velocity. 


Here’s another way to interpret “average” that may make our computation appear 
even more reasonable. The object of our example goes a certain distance between t = 1 
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and t = 3. If instead the object were to travel at the average speed over the same time, it 
should go the same distance. At an average speed of 223/3 feet per second for two seconds 
the object would go 446/3 feet. How far does it actually go? We know how to compute 


this: 
3 
44 
[roa f ie Leaes =. 
1 a fo 


So now we see that another interpretation of the calculation 


1 1446 _ 223 
— | 16¢7+5dt= ae 
2 [ = 23 3 


is: total distance traveled divided by the time in transit, namely, the usual interpretation 
of average speed. 

In the case of speed, or more properly velocity, we can always interpret “average” as 
total (net) distance divided by time. But in the case of a different sort of quantity this 
interpretation does not obviously apply, while the approximation approach always does. 
We might interpret the same problem geometrically: what is the average height of 16x? +5 
on the interval [1,3]? We approximate this in exactly the same way, by adding up many 
sample heights and dividing by the number of samples. In the limit we get the same result: 


1 1446 223 
Jin, 2 1088 reah f 16x? a a eee 


We can interpret this result in a slightly different way. The area under y = 16x? +5 above 
[1,3] is 
3 
446 
| 16¢7 + 5dt = —. 
1 

The area under y = 223/3 over the same interval [1,3] is simply the area of a rectangle 
that is 2 by 223/3 with area 446/3. So the average height of a function is the height of the 
horizontal line that produces the same area over the given interval. 


Exercises 9.4. 


. Find the average height of cos x over the intervals [0, 7/2], [—7/2,7/2], and [0,27]. => 
. Find the average height of x” over the interval [—2, 2]. > 

. Find the average height of 1/x” over the interval [1, A]. > 

. Find the average height of J1— 2? over the interval [—1, 1]. > 


. An object moves with velocity u(t) = —t? +1 feet per second between t = 0 and t = 2. Find 
the average velocity and the average speed of the object between t = 0 andt = 2. > 


a -. WN 
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6. The observation deck on the 102nd floor of the Empire State Building is 1,224 feet above 
the ground. If a steel ball is dropped from the observation deck its velocity at time t is 
approximately u(t) = —32t feet per second. Find the average speed between the time it is 
dropped and the time it hits the ground, and find its speed when it hits the ground. > 


9.5 WoRK 


A fundamental concept in classical physics is work: If an object is moved in a straight 
line against a force F' for a distance s the work done is W = F's. 


EXAMPLE 9.5.1 How much work is done in lifting a 10 pound weight vertically a 
distance of 5 feet? The force due to gravity on a 10 pound weight is 10 pounds at the 
surface of the earth, and it does not change appreciably over 5 feet. The work done is 
W = 10-5 = 50 foot-pounds. 


In reality few situations are so simple. The force might not be constant over the range 


of motion, as in the next example. 


EXAMPLE 9.5.2 How much work is done in lifting a 10 pound weight from the surface 
of the earth to an orbit 100 miles above the surface? Over 100 miles the force due to gravity 
does change significantly, so we need to take this into account. The force exerted on a 10 
pound weight at a distance r from the center of the earth is F = k/r? and by definition 
it is 10 when r is the radius of the earth (we assume the earth is a sphere). How can we 
approximate the work done? We divide the path from the surface to orbit into n small 
subpaths. On each subpath the force due to gravity is roughly constant, with value k/r? 
at distance r;. The work to raise the object from r; to r;,1 is thus approximately k/r?Ar 
and the total work is approximately 


or in the limit 


w= oe 
TO r 


where ro is the radius of the earth and r; is ro plus 100 miles. The work is 


w= Sia 
ae: a 


Using ro = 20925525 feet we have r; = 21453525. The force on the 10 pound weight at 
the surface of the earth is 10 pounds, so 10 = k/20925525”, giving k = 4378775965256250. 


™ ok ok 


TL To 


1 
TO 
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Then 


k  k — 491052320000 
= = ~ 5150052 foot-pounds. 
ri’ To 95349 ne 


Note that if we assume the force due to gravity is 10 pounds over the whole distance we 
would calculate the work as 10(r; — 79) = 10- 100-5280 = 5280000, somewhat higher since 
we don’t account for the weakening of the gravitational force. 


EXAMPLE 9.5.3 How much work is done in lifting a 10 kilogram object from the 
surface of the earth to a distance D from the center of the earth? This is the same 
problem as before in different units, and we are not specifying a value for D. As before 


D 
es oe 
ro T r 


While “weight in pounds” is a measure of force, “weight in kilograms” is a measure of mass. 


D 


TO 


To convert to force we need to use Newton’s law F' = ma. At the surface of the earth the 
acceleration due to gravity is approximately 9.8 meters per second squared, so the force is 
F =10-9.8 = 98. The units here are “kilogram-meters per second squared” or “kg m/s”, 
also known as a Newton (N), so F' = 98 N. The radius of the earth is approximately 6378.1 
kilometers or 6378100 meters. Now the problem proceeds as before. From F = k/r? we 


compute k: 98 = k/63781007, k = 3.986655642 - 10'°. Then the work is: 


k; 
W = —= + 6.250538000 - 10° Newton-meters. 
As D increases W of course gets larger, since the quantity being subtracted, —k/D, gets 
smaller. But note that the work W will never exceed 6.250538000 - 10°, and in fact will 
approach this value as D gets larger. In short, with a finite amount of work, namely 
6.250538000 - 10° N-m, we can lift the 10 kilogram object as far as we wish from earth. 


Next is an example in which the force is constant, but there are many objects moving 
different distances. 


EXAMPLE 9.5.4 Suppose that a water tank is shaped like a right circular cone with 
the tip at the bottom, and has height 10 meters and radius 2 meters at the top. If the 
tank is full, how much work is required to pump all the water out over the top? Here we 
have a large number of atoms of water that must be lifted different distances to get to the 
top of the tank. Fortunately, we don’t really have to deal with individual atoms—we can 
consider all the atoms at a given depth together. 
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Figure 9.5.1 Cross-section of a conical water tank. 


To approximate the work, we can divide the water in the tank into horizontal sections, 
approximate the volume of water in a section by a thin disk, and compute the amount of 
work required to lift each disk to the top of the tank. As usual, we take the limit as the 
sections get thinner and thinner to get the total work. 

At depth h the circular cross-section through the tank has radius r = (10 — h)/5, by 
similar triangles, and area 7(10—h)*/25. A section of the tank at depth h thus has volume 
approximately (10 — h)?/25Ah and so contains o7(10 — h)?/25Ah kilograms of water, 
where o is the density of water in kilograms per cubic meter; 0 = 1000. The force due to 
gravity on this much water is 9.807(10— h)?/25Ah, and finally, this section of water must 
be lifted a distance h, which requires h9.807(10 — h)?/25Ah Newton-meters of work. The 
total work is therefore 


9.8 gu 980000 
vee if h(10 — h)? dh = ——m = 1026254 Newton-meters. 
0 


A spring has a “natural length,” its length if nothing is stretching or compressing 
it. If the spring is either stretched or compressed the spring provides an opposing force; 
according to Hooke’s Law the magnitude of this force is proportional to the distance the 
spring has been stretched or compressed: F' = kx. The constant of proportionality, k, of 
course depends on the spring. Note that x here represents the change in length from the 
natural length. 


EXAMPLE 9.5.5 Suppose k = 5 for a given spring that has a natural length of 0.1 
meters. Suppose a force is applied that compresses the spring to length 0.08. What is 
the magnitude of the force? Assuming that the constant k has appropriate dimensions 
(namely, kg/s?), the force is 5(0.1 — 0.08) = 5(0.02) = 0.1 Newtons. 
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EXAMPLE 9.5.6 How much work is done in compressing the spring in the previous 
example from its natural length to 0.08 meters? From 0.08 meters to 0.05 meters? How 
much work is done to stretch the spring from 0.1 meters to 0.15 meters? We can approx- 
imate the work by dividing the distance that the spring is compressed (or stretched) into 
small subintervals. Then the force exerted by the spring is approximately constant over the 
subinterval, so the work required to compress the spring from x; to x;41 is approximately 
5(x; — 0.1)Axv. The total work is approximately 


n-1 
5(a; = 0.1)Ax 
4=0 
and in the limit 
0.08 g |0.08 2 2 
5(a — 0.1) 5(0.08 — 0.1) 5(0.1 — 0.1) 1 
= 5(a —0.1)dx = ————+~| = = 7 N-m. 
aa eee 2 0.1 2 2 1000. 


The other values we seek simply use different limits. To compress the spring from 0.08 
meters to 0.05 meters takes 


0.05 9 |0.05 
5(x — 0.1 

W = CN ee comtis 

0.08 2 


= 2 —_ 2 
5(0.05 0.1)? _5(0.08—-0.1)? _ 21 | 
2 2 4000 


0.08 
and to stretch the spring from 0.1 meters to 0.15 meters requires 


0.15 9 0-15 
5(x — 0.1 

W = SR Ns cae 

0.1 2 


_ 2 = 2 
_ 5(0.15-0.1)? 5001-017 1 y 
2 2 160 


0.1 


Exercises 9.5. 


1. How much work is done in lifting a 100 kilogram weight from the surface of the earth to an 
orbit 35,786 kilometers above the surface of the earth? => 


2. How much work is done in lifting a 100 kilogram weight from an orbit 1000 kilometers above 
the surface of the earth to an orbit 35,786 kilometers above the surface of the earth? = 


3. A water tank has the shape of an upright cylinder with radius r = 1 meter and height 10 
meters. If the depth of the water is 5 meters, how much work is required to pump all the 
water out the top of the tank? > 


4. Suppose the tank of the previous problem is lying on its side, so that the circular ends are 
vertical, and that it has the same amount of water as before. How much work is required 
to pump the water out the top of the tank (which is now 2 meters above the bottom of the 
tank)? => 
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5. A water tank has the shape of the bottom half of a sphere with radius r = 1 meter. If the 
tank is full, how much work is required to pump all the water out the top of the tank? => 


6. A spring has constant k = 10 kg/ s”. How much work is done in compressing it 1 /10 meter 
from its natural length? > 


7. A force of 2 Newtons will compress a spring from 1 meter (its natural length) to 0.8 meters. 
How much work is required to stretch the spring from 1.1 meters to 1.5 meters? => 


8. A 20 meter long steel cable has density 2 kilograms per meter, and is hanging straight down. 
How much work is required to lift the entire cable to the height of its top end? => 


9. The cable in the previous problem has a 100 kilogram bucket of concrete attached to its lower 
end. How much work is required to lift the entire cable and bucket to the height of its top 
end? => 


10. Consider again the cable and bucket of the previous problem. How much work is required 
to lift the bucket 10 meters by raising the cable 10 meters? (The top half of the cable ends 
up at the height of the top end of the cable, while the bottom half of the cable is lifted 10 
meters.) => 
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Suppose a beam is 10 meters long, and that there are three weights on the beam: a 10 
kilogram weight 3 meters from the left end, a 5 kilogram weight 6 meters from the left end, 
and a 4 kilogram weight 8 meters from the left end. Where should a fulcrum be placed 
so that the beam balances? Let’s assign a scale to the beam, from 0 at the left end to 10 
at the right, so that we can denote locations on the beam simply as x coordinates; the 
weights are at x = 3, x = 6, and x = 8, as in figure 9.6.1. 


5s & —€ °&8 


Figure 9.6.1 A beam with three masses. 


Suppose to begin with that the fulcrum is placed at x = 5. What will happen? Each 
weight applies a force to the beam that tends to rotate it around the fulcrum; this effect 
is measured by a quantity called torque, proportional to the mass times the distance 
from the fulcrum. Of course, weights on different sides of the fulcrum rotate the beam 
in opposite directions. We can distinguish this by using a signed distance in the formula 
for torque. So with the fulcrum at 5, the torques induced by the three weights will be 
proportional to (3 — 5)10 = —20, (6 — 5)5 = 5, and (8 — 5)4 = 12. For the beam to 
balance, the sum of the torques must be zero; since the sum is —20 + 5+ 12 = —3, the 
beam rotates counter-clockwise, and to get the beam to balance we need to move the 
fulcrum to the left. To calculate exactly where the fulcrum should be, we let x denote the 
location of the fulcrum when the beam is in balance. The total torque on the beam is then 
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(3 —£)10+ (6 —%)5+(8—£)4 = 92 — 19%. Since the beam balances at X it must be that 
92 — 19% = 0 or & = 92/19 & 4.84, that is, the fulcrum should be placed at x = 92/19 to 
balance the beam. 

Now suppose that we have a beam with varying density—some portions of the beam 
contain more mass than other portions of the same size. We want to figure out where to 
put the fulcrum so that the beam balances. 


Figure 9.6.2 A solid beam. 


EXAMPLE 9.6.1 Suppose the beam is 10 meters long and that the density is 1+ a 
kilograms per meter at location x on the beam. To approximate the solution, we can think 
of the beam as a sequence of weights “on” a beam. For example, we can think of the 
portion of the beam between x = 0 and x = 1 as a weight sitting at x = 0, the portion 
between x = 1 and x = 2 asa weight sitting at x = 1, and so on, as indicated in figure 9.6.2. 
We then approximate the mass of the weights by assuming that each portion of the beam 
has constant density. So the mass of the first weight is approximately mp = (1+0)1=1 
kilograms, namely, (1 + 0) kilograms per meter times 1 meter. The second weight is 
m, = (1+ 1)1 = 2 kilograms, and so on to the tenth weight with mg = (1+ 9)1 = 10 
kilograms. So in this case the total torque is 


(0 —Z)mp + (1—Z)m, +---+ (9 —£)mg = (0-—£)1 4+ (1-—%)24+---+(9—%)10. 


If we set this to zero and solve for Z we get x = 6. In general, if we divide the beam into 
n portions, the mass of weight number i will be m; = (14+ %;)(%iq1 — vi) = (14+ 2;)Az 
and the torque induced by weight number i will be (x; — %)m; = (a; — Z)(1+ 2;)Az. The 
total torque is then 


(to — Z)(1+a20)Az + (41 —£)(14+41)AG+---4+ (tn-1 — %)(1 + 27-1) Az 


n—-1 n—-1 

= a gj(1+2;)Ax — > Z(1+a2;)Ar 
1=0 1=0 
n—1 n—-1 
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If we set this equal to zero and solve for Z we get an approximation to the balance point 
of the beam: 


n-1 n—-1 
0= ) 2,(1+2,)Ac—2 So (1+ a)Az 
1=0 1=0 


ZY (14+2,;)Azr = So ai(l+ xi) Az 
i=0 
n-1 
S- u(1+2;)Ar 
za 0 
n—-1 
So eye 
i=0 


The denominator of this fraction has a very familiar interpretation. Consider one term of 
the sum in the denominator: (1+ 2;)Az. This is the density near x; times a short length, 
Az, which in other words is approximately the mass of the beam between x; and 2;41. 
When we add these up we get approximately the mass of the beam. 

Now each of the sums in the fraction has the right form to turn into an integral, which 


[0 +2) dx 
[rae | 


The numerator of this fraction is called the moment of the system around zero: 


10 10 

1150 

i (+2) de = | g+a7 dz = —, 
0 0 3 


and the denominator is the mass of the beam: 


in turn gives us the exact value of Zz: 


C= 


10 
| (1+ 2) dx = 60, 
0 


and the balance point, officially called the center of mass, is 


11 
a 50 1 — 5 6.39. 
3 60 18 


8 
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It should be apparent that there was nothing special about the density function o(#) = 
1+ <2 or the length of the beam, or even that the left end of the beam is at the origin. 
In general, if the density of the beam is a(x) and the beam covers the interval [a,b], the 


moment of the beam around zero is 


b 
My = | xo (x) dx 


and the total mass of the beam is 


m= [ox 


and the center of mass is at 

_ Mo 

Daag 
EXAMPLE 9.6.2 Suppose a beam lies on the x-axis between 20 and 30, and has density 
function o(x) = x—19. Find the center of mass. This is the same as the previous example 
except that the beam has been moved. Note that the density at the left end is 20—19 = 1 


and at the right end is 30 — 19 = 11, as before. Hence the center of mass must be at 
approximately 20 + 6.39 = 26.39. Let’s see how the calculation works out. 


30 30 3 9 (30 
19 4750 

20 20 3 2 Jao 3 

30 “2 30 
M= x—19dx = ——19x]| =60 
20 2 90 
M 4 1 4 
o ee ® 26.39. 


M3 60 18 


EXAMPLE 9.6.3 Suppose a flat plate of uniform density has the shape contained by 
y = 27, y=1, and x = 0, in the first quadrant. Find the center of mass. (Since the density 
is constant, the center of mass depends only on the shape of the plate, not the density, or 
in other words, this is a purely geometric quantity. In such a case the center of mass is 
called the centroid.) 

This is a two dimensional problem, but it can be solved as if it were two one dimensional 
problems: we need to find the x and y coordinates of the center of mass, % and y, and 
fortunately we can do these independently. Imagine looking at the plate edge on, from 
below the x-axis. The plate will appear to be a beam, and the mass of a short section 
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M4 


0 x4 1 
Figure 9.6.3 Center of mass for a two dimensional plate. 


of the “beam”, say between x; and 2;41, is the mass of a strip of the plate between 2; 
and 2441. See figure 9.6.3 showing the plate from above and as it appears edge on. Since 
the plate has uniform density we may as well assume that o = 1. Then the mass of the 
plate between 2; and 2;+, is approximately m; = o(1—2?)Agx = (1—2?)Azx. Now we can 
compute the moment around the y-axis: 


: i! 
M,= [ 20 -29)de=3 
0 4 


and the total mass 


: 2 
M= f Q-e)dr=5 
0 3 
and finally 
a. oer 
ae ae 


Next we do the same thing to find y. The mass of the plate between y; and yj+1 is 
approximately n; = /yAy, so 


and 


since the total mass M is the same. The center of mass is shown in figure 9.6.3. 


EXAMPLE 9.6.4 Find the center of mass of a thin, uniform plate whose shape is the 
region between y = cosx and the z-axis between x = —7/2 and x = 7/2. It is clear 
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that z = 0, but for practice let’s compute it anyway. We will need the total mass, so we 


compute it first: 


m/2 wm /2 
M= cosxdx = sinx = 
—n/2 —1/2 
The moment around the y-axis is 
m/2 mr /2 
M, = xcosxdx =cosx+axsinz =0 
—1/2 —n/2 


and the moment around the z-axis is 


Thus 


' . 
1—y2 
M, = [ y - 2arccos y dy = y? arccosy — % Y a ag af 
0 9 5 r 
) 
0 
one. gj = ~ ~ 0.393 

2 8 


Exercises 9.6. 


1. 


12. 


13. 


. Verify that f 2x arccos x dx = x” arccos x 


A beam 10 meters long has density o(a) = x” at distance x from the left end of the beam. 
Find the center of mass Z. => 


A beam 10 meters long has density o(x) = sin(7a/10) at distance x from the left end of the 
beam. Find the center of mass %. => 


A beam 4 meters long has density o(a) = «° at distance « from the left end of the beam. 


Find the center of mass Z. > 
aVl—«x? ~~ arcsine 
2 + 2 


A thin plate lies in the region between y = x” and the x-axis between « = 1 and x = 2. Find 
the centroid. => 


A thin plate fills the upper half of the unit circle 2” + y” = 1. Find the centroid. > 


+C. 


A thin plate lies in the region contained by y = x and y = x”. Find the centroid. > 


A thin plate lies in the region contained by y = 4—” and the x-axis. Find the centroid. > 


/ 


A thin plate lies in the region contained by y = x'/* and the x-axis between x = 0 and x = 1. 


Find the centroid. > 
A thin plate lies in the region contained by /x + \/y = 1 and the axes in the first quadrant. 
Find the centroid. > 


. A thin plate lies in the region between the circle x” + y” = 4 and the circle x” + y”? = 1, 


above the z-axis. Find the centroid. > 


A thin plate lies in the region between the circle x” + y” = 4 and the circle x7 + y” = 1 in 
the first quadrant. Find the centroid. > 


A thin plate lies in the region between the circle x” + y” = 25 and the circle 2” + y” = 16 
above the z-axis. Find the centroid. => 
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9.7 KINETIC ENERGY; IMPROPER INTEGRALS 


Recall example 9.5.3 in which we computed the work required to lift an object from the 
surface of the earth to some large distance D away. Since F = k/x? we computed 


We noticed that as D increases, k/D decreases to zero so that the amount of work increases 
to k/ro. More precisely, 
D 
k kok k 
lim = 00 = lim = + = —. 
He 


D->0o ro D->0o TO TO 


We might reasonably describe this calculation as computing the amount of work required 
to lift the object “to infinity,” and abbreviate the limit as 


Such an integral, with a limit of infinity, is called an improper integral. This is a bit 
unfortunate, since it’s not really “improper” to do this, nor is it really “an integral” —it is 
an abbreviation for the limit of a particular sort of integral. Nevertheless, we’re stuck with 
the term, and the operation itself is perfectly legitimate. It may at first seem odd that a 
finite amount of work is sufficient to lift an object to “infinity”, but sometimes surprising 
things are nevertheless true, and this is such a case. If the value of an improper integral 
is a finite number, as in this example, we say that the integral converges, and if not we 
say that the integral diverges. 

Here’s another way, perhaps even more surprising, to interpret this calculation. We 
know that one interpretation of 


is the area under y = 1/a? from 2 = 1 to x = D. Of course, as D increases this area 


[ se _1,1 
( a —— D iS 


while the area increases, it never exceeds 1, that is 


case | 
1 


The area of the infinite region under y = 1/a? from x = 1 to infinity is finite. 


increases. But since 
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Consider a slightly different sort of improper integral: / 


ae 
ze” dx. There are two 


ways we might try to compute this. First, we could break it up into two more familiar 


integrals: 


eo 2 0 2 ad 2 
: re” ax = | xe” a+ [ ze” dx. 
—oo —oo (0) 


Now we do these as before: 


ee Le 
and 
CO —x? Pp 
i ze * dx= lim — = 
f sae 
0 
so 
= 2 1 
il ze * dr =>—--~+—-=0 
Alternately, we might try 
D 


2 


of 2 p 2 e. 
ze” dx = lim ve” dx= lim — 
D->oo _pD D->oo 


— Co 


fess 


—D? —D? 


2 2 


So we get the same answer either way. This does not always happen; sometimes the second 


approach gives a finite number, while the first approach does not; the exercises provide 


examples. In general, we interpret the integral / f (x) dx according to the first method: 


both integrals / f(x) dx and / f(x) dx must converge for the original integral to 


D 


converge. The second approach does turn out to be useful; when lim pada = Js 


D-> oo —D 


and L is finite, then L is called the Cauchy Principal Value of | Pa \aas 


Here’s a more concrete application of these ideas. We know that in general 


v1 
w= | F dz 
Xo 


is the work done against the force F in moving from xp to x;. In the case that F' is the 


force of gravity exerted by the earth, it is customary to make F’ < 0 since the force is 
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“downward.” This makes the work W negative when it should be positive, so typically the 


Ly 
w=- | Fdz. 
xO 


Also, by Newton’s Law, F' = ma(t). This means that 


= -{ ma(t) dx 
xO 


Unfortunately this integral is a bit problematic: a(t) is in terms of t, while the limits and 


work in this case is defined as 


the “dx” are in terms of x. But x and t are certainly related here: x = x(t) is the function 
that gives the position of the object at time t, so v = v(t) = dx/dt = x'(t) is its velocity 
and a(t) = v(t) = 2’"(t). We can use v = a(t) as a substitution to convert the integral 
from “da” to “dv” in the usual way, with a bit of cleverness along the way: 


dt 
dv = x" (t) dt = a(t) dt = a(t) — dx 
dx 
dx 
a dv = a(t) dx 
udu = a(t) daz. 
Substituting in the integral: 
re mv? |" mv? mve 
=--[- ma(t SE acer a ne 


You may recall seeing the expression mv7/2 in a physics course—it is called the kinetic 
energy of the object. We have shown here that the work done in moving the object from 
one place to another is the same as the change in kinetic energy. 

We know that the work required to move an object from the surface of the earth to 


W= cs ir =. 


At the surface of the earth the acceleration due to gravity is approximately 9.8 meters 


infinity is 


per second squared, so the force on an object of mass m is F' = 9.8m. The radius of the 
earth is approximately 6378.1 kilometers or 6378100 meters. Since the force due to gravity 
obeys an inverse square law, F = k/r? and 9.8m = k/63781007, k = 398665564178000m 
and W = 62505380m. 
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Now suppose that the initial velocity of the object, vo, is just enough to get it to 
infinity, that is, just enough so that the object never slows to a stop, but so that its speed 
decreases to zero, i.e., so that v; = 0. Then 
mop mug — mv 

2 2 9 


62505380m = W = — 


sXe) 
vg = V 125010760 + 11181 meters per second, 


or about 40251 kilometers per hour. This speed is called the escape velocity. Notice that 
the mass of the object, m, canceled out at the last step; the escape velocity is the same 
for all objects. Of course, it takes considerably more energy to get a large object up to 
40251 kph than a small one, so it is certainly more difficult to get a large object into deep 
space than a small one. Also, note that while we have computed the escape velocity for 
the earth, this speed would not in fact get an object “to infinity” because of the large mass 
in our neighborhood called the sun. Escape velocity for the sun starting at the distance of 
the earth from the sun is nearly 4 times the escape velocity we have calculated. 


Exercises 9.7. 


1. Is the area under y = 1/x from 1 to infinity finite or infinite? If finite, compute the area. > 


2. Is the area under y = 1/ x from 1 to infinity finite or infinite? If finite, compute the area. 
=> 


3. Does | x” + 2a —1dx converge or diverge? If it converges, find the value. > 
0 

4. Does / 1/Vx dx converge or diverge? If it converges, find the value. > 
1 


5. Does [ e ° dx converge or diverge? If it converges, find the value. > 
0 


1/2 
6. (2x — i" dx is an improper integral of a slightly different sort. Express it as a limit 


0 
and determine whether it converges or diverges; if it converges, find the value. > 


1 
7. Does [ 1/\/x dx converge or diverge? If it converges, find the value. > 
0 


nm /2 
8. Does [ sec” x dx converge or diverge? If it converges, find the value. > 
0 


co 2 
9. Does j ra 5 dx converge or diverge? If it converges, find the value. = 
= x 


10. Does / x dx converge or diverge? If it converges, find the value. Also find the Cauchy 


Principal Value, if it exists. > 


co 


11. Does sin x dx converge or diverge? If it converges, find the value. Also find the Cauchy 


Principal Value, if it exists. > 
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co 


12. Does cos x dx converge or diverge? If it converges, find the value. Also find the Cauchy 


Principal Value, if it exists. > 


13. Suppose the curve y = 1/z is rotated around the z-axis generating a sort of funnel or horn 
shape, called Gabriel’s horn or Toricelli’s trumpet. Is the volume of this funnel from 
x = 1 to infinity finite or infinite? If finite, compute the volume. > 


14. An officially sanctioned baseball must be between 142 and 149 grams. How much work, 
in Newton-meters, does it take to throw a ball at 80 miles per hour? At 90 mph? At 
100.9 mph? (According to the Guinness Book of World Records, at http://www.baseball- 
almanac.com/recbooks/rb_guin.shtml, “The greatest reliably recorded speed at which a 
baseball has been pitched is 100.9 mph by Lynn Nolan Ryan (California Angels) at Anaheim 
Stadium in California on August 20, 1974.”) => 


9.8 PROBABILITY 


You perhaps have at least a rudimentary understanding of discrete probability, which 
measures the likelihood of an “event” when there are a finite number of possibilities. For 
example, when an ordinary six-sided die is rolled, the probability of getting any particular 
number is 1/6. In general, the probability of an event is the number of ways the event can 
happen divided by the number of ways that “anything” can happen. 

For a slightly more complicated example, consider the case of two six-sided dice. The 
dice are physically distinct, which means that rolling a 2—5 is different than rolling a 5-2; 
each is an equally likely event out of a total of 836 ways the dice can land, so each has a 
probability of 1/36. 

Most interesting events are not so simple. More interesting is the probability of rolling 
a certain sum out of the possibilities 2 through 12. It is clearly not true that all sums are 
equally likely: the only way to roll a 2 is to roll 1-1, while there are many ways to roll a 7. 
Because the number of possibilities is quite small, and because a pattern quickly becomes 
evident, it is easy to see that the probabilities of the various sums are: 


P(2) = P(12) = 1/36 
P(3) = P(11) = 2/36 
P(4) = P(10) = 3/36 
P(5) = P(9) = 4/36 
P(6) = P(8) = 5/36 
P(7) = 6/36 


Here we use P(n) to mean “the probability of rolling an n.” Since we have correctly 
accounted for all possibilities, the sum of all these probabilities is 36/36 = 1; the probability 
that the sum is one of 2 through 12 is 1, because there are no other possibilities. 
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The study of probability is concerned with more difficult questions as well; for example, 
suppose the two dice are rolled many times. On the average, what sum will come up? In 
the language of probability, this average is called the expected value of the sum. This is 
at first a little misleading, as it does not tell us what to “expect” when the two dice are 
rolled, but what we expect the long term average will be. 

Suppose that two dice are rolled 36 million times. Based on the probabilities, we would 
expect about 1 million rolls to be 2, about 2 million to be 3, and so on, with a roll of 7 
topping the list at about 6 million. The sum of all rolls would be 1 million times 2 plus 2 
million times 3, and so on, and dividing by 36 million we would get the average: 


= 6 6 6 6 1 
B= (2-10° +3(2- 10°) +---+ 7(6- 10°) +---+12-10°) 
10° 2-108 6 - 10° 10° 
= 236-106 1 336-100 1 + "36-496 T° + 1436- G08 
= 2P(2) +3P(3)+---+ 7P(7) +--- + 12P(12) 

12 
= Pa) = s 
41=2 


There is nothing special about the 36 million in this calculation. No matter what the 
12 


number of rolls, once we simplify the average, we get the same x iP(i). While the actual 
i=2 
average value of a large number of rolls will not be exactly 7, the average should be close 
to 7 when the number of rolls is large. Turning this around, if the average is not close to 
7, we should suspect that the dice are not fair. 
A variable, say X, that can take certain values, each with a corresponding probability, 
is called a random variable; in the example above, the random variable was the sum of 


the two dice. If the possible values for X are x71, %2,...,0%n, then the expected value of the 


random variable is E(X) = ye x;P(a;). The expected value is also called the mean. 
i=l 

When the number of possible values for X is finite, we say that X is a discrete random 
variable. In many applications of probability, the number of possible values of a random 
variable is very large, perhaps even infinite. To deal with the infinite case we need a 
different approach, and since there is a sum involved, it should not be wholly surprising 
that integration turns out to be a useful tool. It then turns out that even when the 
number of possibilities is large but finite, it is frequently easier to pretend that the number 
is infinite. Suppose, for example, that a dart is thrown at a dart board. Since the dart 
board consists of a finite number of atoms, there are in some sense only a finite number 
of places for the dart to land, but it is easier to explore the probabilities involved by 
pretending that the dart can land on any point in the usual x-y plane. 
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DEFINITION 9.8.1 Let f : R > R be a function. If f(x) > 0 for every x and 


/ f(x) dx = 1 then f is a probability density function. 


We associate a probability density function with a random variable X by stipulating 


b 
that the probability that X is between a and 6 is / f(x) dx. Because of the requirement 


that the integral from —oo to oo be 1, all probabilities are less than or equal to 1, and the 
probability that X takes on some value between —oo and oo is 1, as it should be. 


EXAMPLE 9.8.2 Consider again the two dice example; we can view it in a way that 
more resembles the probability density function approach. Consider a random variable X 
that takes on any real value with probabilities given by the probability density function in 
figure 9.8.1. The function f consists of just the top edges of the rectangles, with vertical 
sides drawn for clarity; the function is zero below 1.5 and above 12.5. The area of each 
rectangle is the probability of rolling the sum in the middle of the bottom of the rectangle, 


n+1/2 
Pir) = / t (a) dx. 


1/2 


or 


The probability of rolling a 4, 5, or 6 is 


13/2 
P(n) = | i, [eae 


Of course, we could also compute probabilities that don’t make sense in the context of the 
dice, such as the probability that X is between 4 and 5.8. 


6/36 4 


5/36 4 


4/36 4 


3/36 4 


2/36 4 


1/36 4 


1 2 3 4 5 6 7 8 9 10 361112 


Figure 9.8.1 A probability density function for two dice. 
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The function 
Fa) =P(X% <a2)= ik f (t)dt 


is called the cumulative distribution function or simply (probability) distribution. 


EXAMPLE 9.8.3 Suppose that a < 6 and 


He)={ =a ifa<a<b 


0 otherwise. 


Then f(x) is the uniform probability density function on [a, b]. and the corresponding 


distribution is the uniform distribution on |a, }]. 


EXAMPLE 9.8.4 Consider the function f(z) = e~* /2. What can we say about 


) et /2 dx? 


We cannot find an antiderivative of f, but we can see that this integral is some finite 


number. Notice that 0 < f(x) = en l2 < e- 2/2 for |x| > 1. This implies that the area 


—x? /2 x/2 


under e is less than the area under e */“, over the interval [l,oo). It is easy to 


CO y) 
—«/2 g _ 
e€ c=, 
: ve 


| et /2 dx 
1 


is some finite number smaller than 2/,/e. Because f is symmetric around the y-axis, 


-—1 lee) 
/ et /2 dz = i; et /2 da. 
—oo 1 
This means that 


- 1 1 0 
/ en? /2 dy = / en 8/2 dy 4 : en? /2 dy + / el dz = A 
a ata ¥ 1 


for some finite positive number A. Now if we let g(x) = f(x)/A, 


CO 1 CO 2 1 
/ g(x) dx af da. 44 ule 


—Co 


compute the latter area, namely 


SO 


so g is a probability density function. It turns out to be very useful, and is called the 
standard normal probability density function or more informally the bell curve, 
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0.5 5 


Figure 9.8.2 The bell curve. 


giving rise to the standard normal distribution. See figure 9.8.2 for the graph of the 


bell curve. 


We have shown that A is some finite number without computing it; we cannot compute 
it with the techniques we have available. By using some techniques from multivariable 
calculus, it can be shown that A = V2r. 


EXAMPLE 9.8.5 The exponential distribution has probability density function 


0 xa<0 
ce" a«>0 


where c is a positive constant. 


The mean or expected value of a random variable is quite useful, as hinted at in 
our discussion of dice. Recall that the mean for a discrete random variable is E(X) = 


n 
Se x;P(x;). In the more general context we use an integral in place of the sum. 
i=1 


DEFINITION 9.8.6 The mean of a random variable X with probability density func- 


CO 


tion f is p= E(X) = i x f(x) dx, provided the integral converges. 
—oo 

When the mean exists it is unique, since it is the result of an explicit calculation. The 
mean does not always exist. 

The mean might look familiar; it is essentially identical to the center of mass of a one- 
dimensional beam, as discussed in section 9.6. The probability density function f plays 
the role of the physical density function, but now the “beam” has infinite length. If we 
consider only a finite portion of the beam, say between a and b, then the center of mass is 
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If we extend the beam to infinity, we get 


because / f(x) dx = 1. In the center of mass interpretation, this integral is the total 


mass of the beam, which is always 1 when f is a probability density function. 


EXAMPLE 9.8.7 ‘The mean of the standard normal distribution is 


and 
i 
V/2T , 


— im fa eee eis 
0 V2 Doo V2 


The sum of these is 0, which is the mean. 


While the mean is very useful, it typically is not enough information to properly 
evaluate a situation. For example, suppose we could manufacture an 11-sided die, with 
the faces numbered 2 through 12 so that each face is equally likely to be down when the die 
is rolled. The value of a roll is the value on this lower face. Rolling the die gives the same 
range of values as rolling two ordinary dice, but now each value occurs with probability 
1/11. The expected value of a roll is 


a ae ree 
it." 4 i 
The mean does not distinguish the two cases, though of course they are quite different. 
If f is a probability density function for a random variable X, with mean pz, we would 
like to measure how far a “typical” value of X is from 4. One way to measure this distance 
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is (X — 1)*; we square the difference so as to measure all distances as positive. To get the 
typical such squared distance, we compute the mean. For two dice, for example, we get 
1 2 6 2 1 30 
2—7)9?— +(38-7°%— +---+ (7-77 4+--- (1-77 24+ (12-77 = =. 

( "36 7 | "367 al "367 ( "36 | "36 36 
Because we squared the differences this does not directly measure the typical distance we 
seek; if we take the square root of this we do get such a measure, \/35/36 ~ 2.42. Doing 
the computation for the strange 11-sided die we get 


21 2 1 21 21 21 

(2 —7) 1te-% eee a eee ae ee) qt Ae?) i = 10; 
with square root approximately 3.16. Comparing 2.42 to 3.16 tells us that the two-dice 
rolls clump somewhat more closely near 7 than the rolls of the weird die, which of course 
we already knew because these examples are quite simple. 

To perform the same computation for a probability density function the sum is replaced 
by an integral, just as in the computation of the mean. The expected value of the squared 
distances is 


called the variance. The square root of the variance is the standard deviation, denoted 
0. 


EXAMPLE 9.8.8 We compute the standard deviation of the standard normal distru- 
bution. The variance is 
1 oe 2 — 2/2 
— ge de: 
V 20 i. 


To compute the antiderivative, use integration by parts, with u = x and dv = ve ® /2 dy, 
This gives 


joann dz = —re~® /? + peer da. 


We cannot do the new integral, but we know its value when the limits are —oo to oo, from 
our discussion of the standard normal distribution. Thus 


a ES 
26 x /2 de = — 


1 oe 1 
— x 
/ 27 i: V2 


LE 


The standard deviation is then V1 = 1. 
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EXAMPLE 9.8.9 Here is a simple example showing how these ideas can be useful. 
Suppose it is known that, in the long run, 1 out of every 100 computer memory chips 
produced by a certain manufacturing plant is defective when the manufacturing process is 
running correctly. Suppose 1000 chips are selected at random and 15 of them are defective. 
This is more than the ‘expected’ number (10), but is it so many that we should suspect 
that something has gone wrong in the manufacturing process? We are interested in the 
probability that various numbers of defective chips arise; the probability distribution is 
discrete: there can only be a whole number of defective chips. But (under reasonable 
assumptions) the distribution is very close to a normal distribution, namely this one: 


2 
~ Vm /1000(.01)(.99)  \2(1000)(.01)(.99) ) ’ 


f(x) 


which is pictured in figure 9.8.3 (recall that exp(x) = e”). 


0.15 5 
0.10 = 
0.05 


0 5 10 15 20 25 


Figure 9.8.3 Normal density function for the defective chips example. 


Now how do we measure how unlikely it is that under normal circumstances we would 
see 15 defective chips? We can’t compute the probability of exactly 15 defective chips, as 
15 15.5 
this would be f(x) dx = 0. We could compute | f(x) dx © 0.036; this means there 
4 


15 4.5 
is only a 3.6% chance that the number of defective chips is 15. (We cannot compute these 


integrals exactly; computer software has been used to approximate the integral values in 
10.5 


this discussion.) But this is misleading: i f(x) dx = 0.126, which is larger, certainly, 
9.5 
but still small, even for the “most likely” outcome. The most useful question, in most 


circumstances, is this: how likely is it that the number of defective chips is “far from” 
the mean? For example, how likely, or unlikely, is it that the number of defective chips 
is different by 5 or more from the expected value of 10? This is the probability that the 
number of defective chips is less than 5 or larger than 15, namely 


Co 


5 
‘| f(x) da+ f(x) dx = 0.11. 
—oo 15 


So there is an 11% chance that this happens—not large, but not tiny. Hence the 15 
defective chips does not appear to be cause for alarm: about one time in nine we would 
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expect to see the number of defective chips 5 or more away from the expected 10. How 
about 20? Here we compute 


(oe) 


0 
/ f(a) dx+ f(x) dx ~ 0.0015. 
—oco 20 


So there is only a 0.15% chance that the number of defective chips is more than 10 away 
from the mean; this would typically be interpreted as too suspicious to ignore—it shouldn’t 
happen if the process is running normally. 

The big question, of course, is what level of improbability should trigger concern? 
It depends to some degree on the application, and in particular on the consequences of 
getting it wrong in one direction or the other. If we’re wrong, do we lose a little money? 
A lot of money? Do people die? In general, the standard choices are 5% and 1%. So what 
we should do is find the number of defective chips that has only, let us say, a 1% chance 
of occurring under normal circumstances, and use that as the relevant number. In other 


words, we want to know when 


10—r oo 
a f(x) a+ [ fle) dx < 0.01. 
—oo 10+r 
A bit of trial and error shows that with r = 8 the value is about 0.011, and with r = 9 it 
is about 0.004, so if the number of defective chips is 19 or more, or 1 or fewer, we should 
look for problems. If the number is high, we worry that the manufacturing process has 
a problem, or conceivably that the process that tests for defective chips is not working 
correctly and is flagging good chips as defective. If the number is too low, we suspect that 


the testing procedure is broken, and is not detecting defective chips. 


Exercises 9.8. 


1. Verify that / e*!? da = 2/rV/e. 
i 
2. Show that the function in example 9.8.5 is a probability density function. Compute the mean 


and standard deviation. => 


3. Compute the mean and standard deviation of the uniform distribution on [a,b]. (See exam- 
ple 9.8.3.) => 


4. What is the expected value of one roll of a fair six-sided die? => 
5. What is the expected sum of one roll of three fair six-sided dice? => 
6. Let yu and o be real numbers with o > 0. Show that 


1 => 
OU og 


is a probability density function. You will not be able to compute this integral directly; use 
a substitution to convert the integral into the one from example 9.8.4. The function N is 
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the probability density function of the normal distribution with mean py and standard 
deviation 0. Show that the mean of the normal distribution is 4 and the standard deviation 
is o. 
7. Let i 
f(x) = mee ae 
0 a<l 
Show that f is a probability density function, and that the distribution has no mean. 


8. Let 
xa -l<a<l 
wey= {1 lee<3 


O otherwise. 
Show that / f(x) dx = 1. Is f a probability density function? Justify your answer. 


9. If you have access to appropriate software, find r so that 


10-—r lo) 
/ f(x) da + | f(x) dx = 0.05, 


—co 10+r 
using the function of example 9.8.9. Discuss the impact of using this new value of r to decide 
whether to investigate the chip manufacturing process. > 


9.9 ARc LENGTH 


Here is another geometric application of the integral: find the length of a portion of a 
curve. As usual, we need to think about how we might approximate the length, and turn 
the approximation into an integral. 

We already know how to compute one simple arc length, that of a line segment. If the 
endpoints are Po(Xo, yo) and Pi(x1,y1) then the length of the segment is the distance be- 


tween the points, \/(21 — 20)? + (y: — yo)2, from the Pythagorean theorem, as illustrated 
in figure 9.9.1. 


(x1, 41) 


J («1 — £9)? + (y1 — yo)? 


Y1 — YO 


(x0, Yo) ry — xO 


Figure 9.9.1 The length of a line segment. 
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Figure 9.9.2 Approximating arc length with line segments. 


Now if the graph of f is “nice” (say, differentiable) it appears that we can approximate 
the length of a portion of the curve with line segments, and that as the number of segments 
increases, and their lengths decrease, the sum of the lengths of the line segments will 
approach the true arc length; see figure 9.9.2. 

Now we need to write a formula for the sum of the lengths of the line segments, in a 
form that we know becomes an integral in the limit. So we suppose we have divided the 
interval [a, b|] into n subintervals as usual, each with length Ax = (b—a)/n, and endpoints 
@ = X0, X1, 22, .--, In = b. The length of a typical line segment, joining (x;, f(z;)) 
to (#41, f(ti41)), is V (Ax)? + (f(ai41) — f(xi))2. By the Mean Value Theorem (6.5.2), 
there is a number ¢; in (%;,2;41) such that f’(t;)Av = f(x;41) — f(a;), so the length of 
the line segment can be written as 


JV (Aa)? + (f'(ti))2Aa? = \/1 + (f'(ti))? Ac. 


The arc length is then 


n-1 b 
jim YO VIF PP Ae = f VIF FOP ae. 
i=0 a 


Note that the sum looks a bit different than others we have encountered, because the 
approximation contains at; instead of an x;. In the past we have always used left endpoints 
(namely, x;) to get a representative value of f on [x;, 2:41]; now we are using a different 
point, but the principle is the same. 

To summarize, to compute the length of a curve on the interval [a,b], we compute the 
integral 


b 
/ J 1+ (f'(a))? dx. 


Unfortunately, integrals of this form are typically difficult or impossible to compute exactly, 
because usually none of our methods for finding antiderivatives will work. In practice this 
means that the integral will usually have to be approximated. 
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EXAMPLE 9.9.1 Let f(x) = Vr? — x?, the upper half circle of radius r. The length 
of this curve is half the circumference, namely ar. Let’s compute this with the arc length 


formula. The derivative f’ is —x/./r? — x? so the integral is 


Tr 72 i r2 Tr 1 
[itastee-f yetee] Jae 


Using a trigonometric substitution, we find the antiderivative, namely arcsin(x/r). Notice 


that the integral is improper at both endpoints, as the function \/1/(r? — x?) is undefined 
when x = tr. So we need to compute 


oF al oy: a 
lim, ff r2 — g? ae+ jim f r2 — g? si 


This is not difficult, and has value 7, so the original integral, with the extra r in front, has 


value mr as expected. 


Exercises 9.9. 

Find the arc length of f(x) = «°/? on [0,2]. > 

Find the arc length of f(x) = «”/8 — Ina on [1,2]. > 

Find the arc length of f(x) = (1/3)(x? + 2)°”? on the interval (0, a]. > 
Find the arc length of f(x) = In(sin x) on the interval [7/4,7/3]. => 


2 ee ee 


Let a > 0. Show that the length of y = cosh on [0,a] is equal to / cosh x dz. 
0 


6. Find the arc length of f(x) = coshz on [0,In2]. > 


7. Set up the integral to find the arc length of sina on the interval [0,7]; do not evaluate the 
integral. If you have access to appropriate software, approximate the value of the integral. 
=> 


8. Set up the integral to find the arc length of y = xe * on the interval [2,3]; do not evaluate the 
integral. If you have access to appropriate software, approximate the value of the integral. 
=> 


9. Find the arc length of y = e” on the interval [0,1]. (This can be done exactly; it is a bit 
tricky and a bit long.) > 


9.10 SURFACE AREA 


Another geometric question that arises naturally is: “What is the surface area of a vol- 
ume?” For example, what is the surface area of a sphere? More advanced techniques 
are required to approach this question in general, but we can compute the areas of some 
volumes generated by revolution. 
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As usual, the question is: how might we approximate the surface area? For a surface 
obtained by rotating a curve around an axis, we can take a polygonal approximation to 
the curve, as in the last section, and rotate it around the same axis. This gives a surface 
composed of many “truncated cones;” a truncated cone is called a frustum of a cone. 
Figure 9.10.1 illustrates this approximation. 


Figure 9.10.1 Approximating a surface (left) by portions of cones (right). 


So we need to be able to compute the area of a frustum of a cone. Since the frustum 
can be formed by removing a small cone from the top of a larger one, we can compute 
the desired area if we know the surface area of a cone. Suppose a right circular cone has 
base radius r and slant height h. If we cut the cone from the vertex to the base circle 
and flatten it out, we obtain a sector of a circle with radius h and arc length 27r, as in 
figure 9.10.2. The angle at the center, in radians, is then 2ar/h, and the area of the cone 
is equal to the area of the sector of the circle. Let A be the area of the sector; since the 
area of the entire circle is 7h”, we have 


A | 2arfh 
th2 = @r 
A=mrh. 


Now suppose we have a frustum of a cone with slant height h and radii rg and rj, as 
in figure 9.10.3. The area of the entire cone is mr 1(ho +h), and the area of the small cone 
is Troho; thus, the area of the frustum is mr1(ho + h) — aroho = T((r1 — To) ho + Tih). By 
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Figure 9.10.2 The area of a cone. 


similar triangles, 
ho = ho th 


TO ry 
With a bit of algebra this becomes (r; — ro)ho = roh; substitution into the area gives 


ula) h = 2rrh. 


T((T1 —To)ho + r1h) = a(roh + rih) = th(ro + 171) = 20 


The final form is particularly easy to remember, with r equal to the average of rp and ry, 
as it is also the formula for the area of a cylinder. (Think of a cylinder of radius r and 


height h as the frustum of a cone of infinite height.) 


ho 


Figure 9.10.3. The area of a frustum. 


Now we are ready to approximate the area of a surface of revolution. On one subin- 
terval, the situation is as shown in figure 9.10.4. When the line joining two points on the 
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curve is rotated around the x-axis, it forms a frustum of a cone. The area is 


arr = ant ea) a i) EE) Ac. 


Here \/1+(f’(t;))? Ax is the length of the line segment, as we found in the previous 
section. Assuming f is a continuous function, there must be some x7 in [x;,2;41] such 
that (f(a;) + f(vi41))/2 = f(x7), so the approximation for the surface area is 


Sans )f1+(f 2 Ax. 


This is not quite the sort of sum we have seen before, as it contains two different values 
in the interval |x;,7;41], namely x; and t;. Nevertheless, using more advanced techniques 
than we have available here, it turns out that 


lim Sons Jl + (fi (ti Pae= fo Qn f(x)/1 + (f'(x))? dx 


is the surface area we seek. (Roughly speaking, this is because while x; and t; are distinct 
values in |x;,7;41], they get closer and closer to each other as the length of the interval 


shrinks. ) 
A (zi41, f(@i41)) 
(wi, Fea) 7 | 


* 
Ly 2; Li4d 


Figure 9.10.4 One subinterval. 


EXAMPLE 9.10.1 Wecompute the surface area of a sphere of radius r. The sphere can 
be obtained by rotating the graph of f(a) = Vr? — x? about the z-axis. The derivative 
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f' is —x/Vr? — x?, so the surface area is given by 


Tr 2 
A=2n f Vr2 — x2 aoe 
3 r2—@ 


r 2 
=2n [ Vr? — x2 ——, de 
- Vr2—9 
=2n [ rde=2nr 1 dx = 4rr? 


If the curve is rotated around the y axis, the formula is nearly identical, because the 


length of the line segment we use to approximate a portion of the curve doesn’t change. 
Instead of the radius f(x; ), we use the new radius Z; = (x; +27;41)/2, and the surface area 
integral becomes 


b 
/ Smale ay? de 


EXAMPLE 9.10.2 Compute the area of the surface formed when f(x) = x? between 
O and 2 is rotated around the y-axis. 
We compute f’(x) = 2z, and then 


2 
an f V1+ 4x? dx = ar —1), 
0 


by a simple substitution. 


Exercises 9.10. 


1. Compute the area of the surface formed when f(a) = 2/1 — x between —1 and 0 is rotated 
around the z-axis. > 


2. Compute the surface area of example 9.10.2 by rotating f(x) = \/x around the z-axis. 


3. Compute the area of the surface formed when f(x) = x°® between 1 and 3 is rotated around 
the x-axis. > 


4. Compute the area of the surface formed when f(x) = 2+ cosh(x) between 0 and 1 is rotated 
around the z-axis. > 


5. Consider the surface obtained by rotating the graph of f(a) = 1/2, x > 1, around the z-axis. 
This surface is called Gabriel’s horn or Toricelli’s trumpet. In exercise 13 in section 9.7 
we saw that Gabriel’s horn has finite volume. Show that Gabriel’s horn has infinite surface 
area. 


6. Consider the circle (x — 2)? + y? = 1. Sketch the surface obtained by rotating this circle 
about the y-axis. (The surface is called a torus.) What is the surface area? > 


7. 


8. 
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Consider the ellipse with equation x” /4+ y’ = 1. If the ellipse is rotated around the x-axis 
it forms an ellipsoid. Compute the surface area. > 

Generalize the preceding result: rotate the ellipse given by x Fi ao y” i b” = 1 about the 
x-axis and find the surface area of the resulting ellipsoid. You should consider two cases, 
when a > 6 and when a < b. Compare to the area of a sphere. => 


10 


Polar Coordinates, 
Parametric Equations 


10.1 POLAR COORDINATES 


Coordinate systems are tools that let us use algebraic methods to understand geometry. 
While the rectangular (also called Cartesian) coordinates that we have been using are 
the most common, some problems are easier to analyze in alternate coordinate systems. 

A coordinate system is a scheme that allows us to identify any point in the plane or 
in three-dimensional space by a set of numbers. In rectangular coordinates these numbers 
are interpreted, roughly speaking, as the lengths of the sides of a rectangle. In polar 
coordinates a point in the plane is identified by a pair of numbers (r, 0). The number 0 
measures the angle between the positive x-axis and a ray that goes through the point, as 
shown in figure 10.1.1; the number r measures the distance from the origin to the point. 
Figure 10.1.1 shows the point with rectangular coordinates (1, V3) and polar coordinates 
(2,7/3), 2 units from the origin and 7/3 radians from the positive x-axis. 


V3 7 (2, 7/3) 


1 


Figure 10.1.1 Polar coordinates of the point (1, V3). 
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Just as we describe curves in the plane using equations involving x« and y, so can we 
describe curves using equations involving r and 8. Most common are equations of the form 


r= 7), 


EXAMPLE 10.1.1 Graph the curve given by r = 2. All points with r = 2 are at 
distance 2 from the origin, so r = 2 describes the circle of radius 2 with center at the 


origin. 


EXAMPLE 10.1.2 Graph the curve given by r = 1+ cos@. We first consider y = 
1+ cos, as in figure 10.1.2. As @ goes through the values in [0, 27], the value of r tracks 
the value of y, forming the “cardioid” shape of figure 10.1.2. For example, when 6 = 7/2, 
r = 1+ cos(z/2) = 1, so we graph the point at distance 1 from the origin along the 
positive y-axis, which is at an angle of 7/2 from the positive z-axis. When 0 = 77/4, 
r= 1+ cos(77/4) = 14+ V2/2 © 1.71, and the corresponding point appears in the fourth 
quadrant. This illustrates one of the potential benefits of using polar coordinates: the 


equation for this curve in rectangular coordinates would be quite complicated. 


m/2 1 an /2 Qa 
Figure 10.1.2 A cardioid: y= 1-+cosz on the left, r = 1+ cos@ on the right. 


Each point in the plane is associated with exactly one pair of numbers in the rect- 
angular coordinate system; each point is associated with an infinite number of pairs in 
polar coordinates. In the cardioid example, we considered only the range 0 < @ < 27, 
and already there was a duplicate: (2,0) and (2,27) are the same point. Indeed, every 
value of @ outside the interval [0,27) duplicates a point on the curve r = 1+ cos@ when 
0 < @ < 27. We can even make sense of polar coordinates like (—2,7/4): go to the direc- 
tion 7/4 and then move a distance 2 in the opposite direction; see figure 10.1.3. As usual, 
a negative angle 6 means an angle measured clockwise from the positive x-axis. The point 
in figure 10.1.3 also has coordinates (2, 57/4) and (2, —37/4). 

The relationship between rectangular and polar coordinates is quite easy to under- 
stand. The point with polar coordinates (r,@) has rectangular coordinates x = rcos@ 
and y = rsin@; this follows immediately from the definition of the sine and cosine func- 
tions. Using figure 10.1.3 as an example, the point shown has rectangular coordinates 


10.1 Polar Coordinates 241 


2 Ss 
1 / 
/ 
[<7 
—2 -1 7% 1 2 
7 
/ 
—1 
4 
_2 1 


Figure 10.1.3. The point (—2, 7/4) = (2, 52/4) = (2, —37/4) in polar coordinates. 


a = (—2)cos(1/4) = —V2 = 1.4142 and y = (—2)sin(1/4) = —V2. This makes it very 
easy to convert equations from rectangular to polar coordinates. 


EXAMPLE 10.1.3 Find the equation of the line y = 3x + 2 in polar coordinates. We 
2 


sin 8 — 3cos0° 


merely substitute: rsin@ = 3rcos@ + 2, or r = 


EXAMPLE 10.1.4 Find the equation of the circle (x — 1/2)? + y* = 1/4 in polar 
coordinates. Again substituting: (rcos@ — 1/2)? +r? sin? @ = 1/4. A bit of algebra turns 
this into r = cos(t). You should try plotting a few (r,@) values to convince yourself that 


this makes sense. 


EXAMPLE 10.1.5 Graph the polar equation r = 6. Here the distance from the origin 
exactly matches the angle, so a bit of thought makes it clear that when 6 > 0 we get the 
spiral of Archimedes in figure 10.1.4. When @ < 0, r is also negative, and so the full graph 


is the right hand picture in the figure. 


(7/2, 7/2) (—1/2, —1/2) 


(w, 7) 


Figure 10.1.4 ‘The spiral of Archimedes and the full graph of r = @. 


Converting polar equations to rectangular equations can be somewhat trickier, and 
graphing polar equations directly is also not always easy. 
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EXAMPLE 10.1.6 Graph r = 2sin@. Because the sine is periodic, we know that we 
will get the entire curve for values of 6 in [0,27). As @ runs from 0 to 7/2, r increases 
from 0 to 2. Then as 6 continues to 7, r decreases again to 0. When @ runs from 7 to 
27, r is negative, and it is not hard to see that the first part of the curve is simply traced 
out again, so in fact we get the whole curve for values of @ in [0,7). Thus, the curve looks 
something like figure 10.1.5. Now, this suggests that the curve could possibly be a circle, 
and if it is, it would have to be the circle 2? + (y — 1)? = 1. Having made this guess, we 
can easily check it. First we substitute for x and y to get (rcos6)? + (rsin@ — 1)? = 1; 


expanding and simplifying does indeed turn this into r = 2sin@. 


—1 0 1 
Figure 10.1.5 Graph of r = 2sin0. 
Exercises 10.1. 


1. Plot these polar coordinate points on one graph: (2,7/3), (—3,a/2), (—2,—7/4), (1/2,7), 
(1, 47/3), (0, 37/2). 
Find an equation in polar coordinates that has the same graph as the given equation in 
rectangular coordinates. 


2. y=3r > 3. y=-4> 

4. cy =1> 5B. oe ty=5S5 
6. y=r=> 7 y=snzr> 

8 y=or4+2> 9. 22> 

10. y=2?4+1> 11. y=32?-22 > 
12. y=u't+y’ => 
Sketch the curve. 

13. r=cosé 14. r=sin(6+ 7/4) 
15. r=-—secd 16. r=0/2,0>0 
17. r=146'/n’ 18. r=cotécscd 
19. : 20. r? = —2secO csc 


= sin 9 + cos 6 
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In the exercises below, find an equation in rectangular coordinates that has the same graph as 
the given equation in polar coordinates. 


21. 7 
23. r=secOcscO > 24. r=tand=> 


sin(30) > 22. r=sin’@=> 


10.2 SLOPES IN POLAR COORDINATES 


When we describe a curve using polar coordinates, it is still a curve in the x-y plane. We 
would like to be able to compute slopes and areas for these curves using polar coordinates. 
We have seen that « = rcos@ and y = rsin@ describe the relationship between polar 
and rectangular coordinates. If in turn we are interested in a curve given by r = f(0), 
then we can write x = f(@)cos@ and y = f(@) sin, describing x and y in terms of 6 alone. 
The first of these equations describes # implicitly in terms of x, so using the chain rule we 
may compute 
dy  dyddo 
dx d0 da’ 
Since d@/dx = 1/(dx/d0), we can instead compute 


dy _dy/d0 __ f(@)cos@ + f’(6) sin® 
dx dx/d0  —f(0)sin@+ f’(0)cos0 


EXAMPLE 10.2.1 Find the points at which the curve given by r = 1+ cos@ has a 
vertical or horizontal tangent line. Since this function has period 27, we may restrict our 
attention to the interval [0, 27) or (—7, 7], as convenience dictates. First, we compute the 
slope: 

dy (1 + cos 0) cos 6 — sin @ sin 0 cos @ + cos? 6 — sin? @ 


dx —(1+cos@)sin@—sin@cos@  —sin@—2sin@cosé 


This fraction is zero when the numerator is zero (and the denominator is not zero). The 
numerator is 2cos? 6 + cos@ — 1 so by the quadratic formula 


-14 V144-2 | 1 


m —l or 5" 


This means 0 is 7 or +7/3. However, when 6 = 7, the denominator is also 0, so we cannot 


cos @ = 


conclude that the tangent line is horizontal. 
Setting the denominator to zero we get 


—sin@ — 2sin@cos@é = 0 
sin 0(1 + 2cos6@) = 0, 


so either sin? = 0 or cos@ = —1/2. The first is true when 6 is 0 or 7, the second when 0 
is 27/3 or 47/3. However, as above, when 0 = 7, the numerator is also 0, so we cannot 
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conclude that the tangent line is vertical. Figure 10.2.1 shows points corresponding to 0 
equal to 0, +7/3, 27/3 and 47/3 on the graph of the function. Note that when 0 = z the 
curve hits the origin and does not have a tangent line. 


Figure 10.2.1 Points of vertical and horizontal tangency for r = 1+ cos@. 


We know that the second derivative f’(x) is useful in describing functions, namely, 
in describing concavity. We can compute f(x) in terms of polar coordinates as well. We 
already know how to write dy/dz = y’ in terms of 0, then 


ddy dy’ dy’ dé _ dy'/dé 
dxdx dx d0dx_ dzx/do 


EXAMPLE 10.2.2 We find the second derivative for the cardioid r = 1 + cos @: 


d cos 6 + cos? @ — sin? 0 ae 3(1 + cos @) 1 

dd —sin0d—2sin@cosO dz/d@  ~—(sin@ + 2sin@cos8)2_ —(sin@ + 2sin@ cos @) 
7 —3(1 + cos @) 
(sind + 2sin @ cos @)3 


The ellipsis here represents rather a substantial amount of algebra. We know from above 
that the cardioid has horizontal tangents at +7 /3; substituting these values into the second 
derivative we get y’(a/3) = —V3/2 and y”(—2/3) = V3/2, indicating concave down and 
concave up respectively. This agrees with the graph of the function. 


Exercises 10.2. 
Compute y’ = dy/dx and y" = d?y/dz’. 

1 r=0> 2. r=1+sind= 
3. r=cosd=> 4. r=sind=> 

5. r=sec0 > 6. r=sin(20) > 
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Sketch the curves over the interval [0,27] unless otherwise stated. 


7. r=sin@+cosé 8. r=2+42sin0 
9. r=5+sind 10. r=2+cosé 
1 
i. r= 5 + 0088 12. r=cos(0/2),0<0<4n 
13. r=sin(0/3),0 <0 < 6x 14. r=sin’6 
15. r=1+cos*(20) 16. r=sin’(36) 
17. r=tan? 18. r=sec(6/2),0<0< 47 
19. r=1+sec0 20. r= pues. eee 
1 — cosé 
1 

21. = 22. = cot(20 

7 1+ siné eee) 


24. f= n7/0,050 < ce 24. r=14+7/0,0<0<c 
25. r= V/7/0,0<0<c 


10.3 AREAS IN POLAR COORDINATES 


We can use the equation of a curve in polar coordinates to compute some areas bounded 
by such curves. The basic approach is the same as with any application of integration: find 
an approximation that approaches the true value. For areas in rectangular coordinates, we 
approximated the region using rectangles; in polar coordinates, we use sectors of circles, 
as depicted in figure 10.3.1. Recall that the area of a sector of a circle is ar?/2, where 
a is the angle subtended by the sector. If the curve is given by r = f(0), and the angle 
subtended by a small sector is A@, the area is (A0)(f(0))?/2. Thus we approximate the 
total area as 


In the limit this becomes 


b 
1 
/ 5 (0)? a0. 


EXAMPLE 10.3.1 We find the area inside the cardioid r = 1 + cos 0. 


@  sin20.|*" 3 
(6+ 2sind+ 54 — )} = . 


27 1 i] 20 
| ~(1+cos 0)? dO = =| 1+2 cos 6+cos? 6 dO = 
0 2 2 Jo 


Nl rR 


0 


EXAMPLE 10.3.2 We find the area between the circles r = 2 and r = 4sin@, as shown 
in figure 10.3.2. The two curves intersect where 2 = 4sin6@, or sin? = 1/2, so 0 = 7/6 or 
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Figure 10.3.1 Approximating area by sectors of circles. 
57/6. The area we want is then 


51/6 4 
=| 16 sin? @—4d6 = —7 + 2V3. 
r/6 3 


Figure 10.3.2 An area between curves. 


This example makes the process appear more straightforward than it is. Because 
points have many different representations in polar coordinates, it is not always so easy to 
identify points of intersection. 


EXAMPLE 10.3.3 We find the shaded area in the first graph of figure 10.3.3 as the 
difference of the other two shaded areas. The cardioid is r = 1 + sin@ and the circle is 
r = 3sin0. We attempt to find points of intersection: 


1+sin@é = 3sin0 
1=2sin0 
1/2 =sin 0. 


This has solutions 0 = 7/6 and 57/6; 7/6 corresponds to the intersection in the first quad- 
rant that we need. Note that no solution of this equation corresponds to the intersection 
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point at the origin, but fortunately that one is obvious. The cardioid goes through the 


origin when 6 = —7/2; the circle goes through the origin at multiples of 7, starting with 
0. 


Now the larger region has area 


n/6 
z itd got 4 
( ) 


29 2 16 
and the smaller has area 
ar /6 3 9 
5 f (3sin6)? dd = —" - V3 
0 8 16 


so the area we seek is 7/8. 


Figure 10.3.3. An area between curves. 


Ezercises 10.3. 


Find the area enclosed by the curve. 
1. r=Vsinéd => 2. r=2+cosé => 
3. r=secd,7/6 <0 < 24/3 => 4. r=cos#,0<0<27/3> 
5. r=2acosé,a>0=> 6. r=4+3sind => 
7. Find the area inside the loop formed by r = tan(0/2). => 
8. Find the area inside one loop of r = cos(30). => 
9. Find the area inside one loop of r = sin? 0. > 
10. Find the area inside the small loop of r = (1/2) + cos. > 
11. Find the area inside r = (1/2) + cos 6, including the area inside the small loop. > 
12. Find the area inside one loop of r? = cos(20). > 


csc 6 


a 


13. Find the area enclosed by r = tané@ and r = 
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14. Find the area inside r = 2cos@ and outside r= 1. > 

15. Find the area inside r = 2sin@ and above the line r = (3/2) csc 0. > 
16. Find the area inside r=0,0<0< 27. => 

17. Find the area inside r = V6, 0<0<27.> 

18. Find the area inside both r = V3cos@ and r = sind. > 

19. Find the area inside both r = 1—cos@ and r=cosé. > 


20. The center of a circle of radius 1 is on the circumference of a circle of radius 2. Find the area 
of the region inside both circles. => 


21. Find the shaded area in figure 10.3.4. The curve is r= 0,0<0< 37. > 


‘] 
\ 


Figure 10.3.4 An area bounded by the spiral of Archimedes. 


10.4 PARAMETRIC EQUATIONS 


When we computed the derivative dy/dx using polar coordinates, we used the expressions 
x = f(@)cos@ and y = f(@)sin@. These two equations completely specify the curve, 
though the form r = f(0) is simpler. The expanded form has the virtue that it can easily 
be generalized to describe a wider range of curves than can be specified in rectangular or 
polar coordinates. 

Suppose f(t) and g(t) are functions. Then the equations x = f(t) and y = g(t) 
describe a curve in the plane. In the case of the polar coordinates equations, the variable 
t is replaced by 6 which has a natural geometric interpretation. But t¢ in general is simply 
an arbitrary variable, often called in this case a parameter, and this method of specifying 
a curve is known as parametric equations. One important interpretation of t is time. 
In this interpretation, the equations x = f(t) and y = g(t) give the position of an object 
at time f. 
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EXAMPLE 10.4.1 Describe the path of an object that moves so that its position at 
time t is given by x = cost, y = cos*t. We see immediately that y = x”, so the path lies 
on this parabola. The path is not the entire parabola, however, since x = cost is always 
between —1 and 1. It is now easy to see that the object oscillates back and forth on the 


parabola between the endpoints (1,1) and (—1, 1), and is at point (1,1) at time t = 0. 


It is sometimes quite easy to describe a complicated path in parametric equations 
when rectangular and polar coordinate expressions are difficult or impossible to devise. 


EXAMPLE 10.4.2 A wheel of radius 1 rolls along a straight line, say the z-axis. A 
point on the rim of the wheel will trace out a curve, called a cycloid. Assume the point 
starts at the origin; find parametric equations for the curve. 

Figure 10.4.1 illustrates the generation of the curve (click on the AP link to see an 
animation). The wheel is shown at its starting point, and again after it has rolled through 
about 490 degrees. We take as our parameter ¢ the angle through which the wheel has 
turned, measured as shown clockwise from the line connecting the center of the wheel 
to the ground. Because the radius is 1, the center of the wheel has coordinates (t,1). 
We seek to write the coordinates of the point on the rim as (t + Az,1+ Ay), where 
Ax and Ay are as shown in figure 10.4.2. These values are nearly the sine and cosine 
of the angle t, from the unit circle definition of sine and cosine. However, some care is 
required because we are measuring t from a nonstandard starting line and in a clockwise 
direction, as opposed to the usual counterclockwise direction. A bit of thought reveals 
that Ar = —sint and Ay = —cost. Thus the parametric equations for the cycloid are 


f=7=sint, y= 1— cost. 


Figure 10.4.1 A cycloid. (AP) 


aN 


Figure 10.4.2 The wheel. 
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Exercises 10.4. 


1. 


8. 


What curve is described by x = t’, y = t*? If t is interpreted as time, describe how the 
object moves on the curve. 


. What curve is described by x = 3cost, y = 3sint? If t is interpreted as time, describe how 


the object moves on the curve. 


. What curve is described by x = 3cost, y = 2sint? If t is interpreted as time, describe how 


the object moves on the curve. 


. What curve is described by x = 3sint, y = 3cost? If t is interpreted as time, describe how 


the object moves on the curve. 


. Sketch the curve described by « = t® —t, y = t’. If t is interpreted as time, describe how the 


object moves on the curve. 


. A wheel of radius 1 rolls along a straight line, say the x-axis. A point P is located halfway 


between the center of the wheel and the rim; assume P starts at the point (0,1/2). As the 
wheel rolls, P traces a curve; find parametric equations for the curve. > 


. A wheel of radius 1 rolls around the outside of a circle of radius 3. A point P on the rim of 


the wheel traces out a curve called a hypercycloid, as indicated in figure 10.4.3. Assuming 
P starts at the point (3,0), find parametric equations for the curve. > 


Figure 10.4.3. A hypercycloid and a hypocycloid. 


A wheel of radius 1 rolls around the inside of a circle of radius 3. A point P on the rim of 
the wheel traces out a curve called a hypocycloid, as indicated in figure 10.4.3. Assuming 
P starts at the point (3,0), find parametric equations for the curve. > 


. An involute of a circle is formed as follows: Imagine that a long (that is, infinite) string is 


wound tightly around a circle, and that you grasp the end of the string and begin to unwind 
it, keeping the string taut. The end of the string traces out the involute. Find parametric 
equations for this curve, using a circle of radius 1, and assuming that the string unwinds 
counter-clockwise and the end of the string is initially at (1,0). Figure 10.4.4 shows part of 
the curve; the dotted lines represent the string at a few different times. > 
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Figure 10.4.4 An involute of a circle. 


10.5 CALCULUS WITH PARAMETRIC EQUATIONS 


We have already seen how to compute slopes of curves given by parametric equations—it 
is how we computed slopes in polar coordinates. 


EXAMPLE 10.5.1 Find the slope of the cycloid x = t—sint, y = 1—cost. We compute 
x’ =1—cost, y’ =sint, so 

dy __—sint 

dx  1—cost 


Note that when ¢t is an odd multiple of 7, like a or 37, this is (0/2) = 0, so there is 
a horizontal tangent line, in agreement with figure 10.4.1. At even multiples of 7, the 
fraction is 0/0, which is undefined. The figure shows that there is no tangent line at such 


points. 


Areas can be a bit trickier with parametric equations, depending on the curve and the 
area desired. We can potentially compute areas between the curve and the x-axis quite 
easily. 


EXAMPLE 10.5.2 Find the area under one arch of the cycloid x = t—sint, y = 1—cost. 
We would like to compute 
27 
ii y dx, 
0 


but we do not know y in terms of x. However, the parametric equations allow us to make 
a substitution: use y = 1 — cost to replace y, and compute dx = (1 — cost) dt. Then the 
integral becomes 


27 
| (1 — cost)(1 — cost) dt = 37. 
0 
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Note that we need to convert the original x limits to ¢ limits using « = t — sint. When 
x = 0, t = sint, which happens only when ¢ = 0. Likewise, when x = 27, t — 27 = sint 
and t = 27. Alternately, because we understand how the cycloid is produced, we can see 
directly that one arch is generated by 0 < t < 27. In general, of course, the ¢ limits will 
be different than the x limits. 


This technique will allow us to compute some quite interesting areas, as illustrated by 
the exercises. 

As a final example, we see how to compute the length of a curve given by parametric 
equations. Section 9.9 investigates arc length for functions given as y in terms of x, and 


b 2 
dy 
1 —|]d 
: y* (=) : 
Using some properties of oe a the chain rule, we can convert this to use 
parametric equations 7 = oe — at 
(<*) dy dt 
\/1 d. 
[ +( w= fy ( We (2) @) da 
= dx \? (4 
= — —} dt 
[y( Me) 7) a 
-[ Ve Hy? at. 


Here u and v are the t limits corresponding to the x limits a and b. 


develops the formula for length: 


=. 


EXAMPLE 10.5.3 Find the length of one arch of the cycloid. From x = t — sint, 
y =1-—cost, we get the derivatives f’ = 1 — cost and g’ = sint, so the length is 


27 27 
i \/ (1 — cost)? + sin? ¢ dt = V2 —2cost dt. 
0 


0 


Now we use the formula sin?(t/2) = (1 — cos(t))/2 or 4sin?(t/2) = 2 — 2cost to get 


/ \/4sin?(t/2) dt 


Since 0 < t < 27, sin(t/2) > 0, so we can rewrite this as 


27 
| 2sin(t/2) dt = 8. 
0 
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Exercises 10.5. 


1. 


Consider the curve of exercise 6 in section 10.4. Find all values of t for which the curve has 
a horizontal tangent line. > 


. Consider the curve of exercise 6 in section 10.4. Find the area under one arch of the curve. 


=> 


. Consider the curve of exercise 6 in section 10.4. Set up an integral for the length of one arch 


of the curve. > 


. Consider the hypercycloid of exercise 7 in section 10.4. Find all points at which the curve 


has a horizontal tangent line. > 


. Consider the hypercycloid of exercise 7 in section 10.4. Find the area between the large circle 


and one arch of the curve. > 


. Consider the hypercycloid of exercise 7 in section 10.4. Find the length of one arch of the 


curve. => 


7. Consider the hypocycloid of exercise 8 in section 10.4. Find the area inside the curve. > 


10. 


11. 
12. 


. Consider the hypocycloid of exercise 8 in section 10.4. Find the length of one arch of the 


curve. => 


. Recall the involute of a circle from exercise 9 in section 10.4. Find the point in the first 


quadrant in figure 10.4.4 at which the tangent line is vertical. => 


Recall the involute of a circle from exercise 9 in section 10.4. Instead of an infinite string, 
suppose we have a string of length 7 attached to the unit circle at (—1,0), and initially laid 
around the top of the circle with its end at (1,0). If we grasp the end of the string and begin 
to unwind it, we get a piece of the involute, until the string is vertical. If we then keep the 
string taut and continue to rotate it counter-clockwise, the end traces out a semi-circle with 
center at (—1,0), until the string is vertical again. Continuing, the end of the string traces 
out the mirror image of the initial portion of the curve; see figure 10.5.1. Find the area of 
the region inside this curve and outside the unit circle. > 


Find the length of the curve from the previous exercise, shown in figure 10.5.1. > 
Find the length of the spiral of Archimedes (figure 10.3.4) for 0 <6 < 27. => 


Figure 10.5.1 A region formed by the end of a string. 
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Sequences and Series 


Consider the following sum: 


za ee eee 
2'°4° 8° 16 2i 


The dots at the end indicate that the sum goes on forever. Does this make sense? Can 
we assign a numerical value to an infinite sum? While at first it may seem difficult or 
impossible, we have certainly done something similar when we talked about one quantity 
getting “closer and closer” to a fixed quantity. Here we could ask whether, as we add more 
and more terms, the sum gets closer and closer to some fixed value. That is, look at 


al 
5 3D 

a | 

i ge 

/ ae ens | 
as 
ieee as ee 
16 2 4° 8 16 


and so on, and ask whether these values have a limit. It seems pretty clear that they do, 
namely 1. In fact, as we will see, it’s not hard to show that 
1 Ds eth - 2. Al 


Pie ee +.---o= 1 
Oe BG Qo Qt 2' 
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and then 
1 
lim 1—-— =1-—-0=1. 


i oo 2 
There is one place that you have long accepted this notion of infinite sum without really 
thinking of it as a sum: 


Oe ee 3 3 1 
0.33933 = 75 + T99 + tooo + T0000 t= 3? 
for example, or 
bs. 1 
3.14159... =3+—4——4 aE 3 i ea eae ss 


10 100 1000 = §=10000 ~=100000 


Our first task, then, to investigate infinite sums, called series, is to investigate limits of 
sequences of numbers. That is, we officially call 


rar ame 4 8 16 Qi 


a series, while 


is a sequence, and 


el 26 ol 
S- — = lim —, 
toe 2 1-00 2 


that is, the value of a series is the limit of a particular sequence. 


11.1 SEQUENCES 


While the idea of a sequence of numbers, a1, @2,a3,... is straightforward, it is useful to 
think of a sequence as a function. We have up until now dealt with functions whose domains 
are the real numbers, or a subset of the real numbers, like f(a) = sinz. A sequence is a 
function with domain the natural numbers N = {1,2,3,...} or the non-negative integers, 
Z=° = {0,1,2,3,...}. The range of the function is still allowed to be the real numbers; in 
symbols, we say that a sequence is a function f: N — R. Sequences are written in a few 
different ways, all equivalent; these all mean the same thing: 


Qa1,92,Q43,..-.- 
{G@n}na1 
{f(") fnaa 


As with functions on the real numbers, we will most often encounter sequences that 
can be expressed by a formula. We have already seen the sequence a; = f(i) = 1 —1/2', 
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and others are easy to come by: 


ae Ess 

f(n) = 55 

f(n) = sin(n7/6) 
fl) = (i — a + 2) 


Frequently these formulas will make sense if thought of either as functions with domain R 
or N, though occasionally one will make sense only for integer values. 
Faced with a sequence we are interested in the limit 


lim: 7G) linia; 
tCo 1 0O 


We already understand 
lina: (ae) 


LOO 


when 2 is a real valued variable; now we simply want to restrict the “input” values to be 
integers. No real difference is required in the definition of limit, except that we specify, per- 
haps implicitly, that the variable is an integer. Compare this definition to definition 4.10.4. 


DEFINITION 11.1.1 Suppose that {a,,}°°_, is a sequence. We say that lim a, = L 
noo 


if for every € > 0 there is an N > 0 so that whenever n > N, la, — L| <¢. If lim a, = L 
n> oo 


we say that the sequence converges, otherwise it diverges. 


If f(z) defines a sequence, and f(x) makes sense, and lim f(x) = L, then it is clear 
xL—->CO 
that lim f(z) = L as well, but it is important to note that the converse of this statement 
1 CO 
is not true. For example, since lim (1/x) = 0, it is clear that also lim (1/7) = 0, that is, 
xL—->> CO i oe ae, @) 


the numbers 
111i1di1£<édi 


V2" 3°4°5 6°" 


get closer and closer to 0. Consider this, however: Let f(n) = sin(n7z). This is the sequence 
sin(O7), sin(17), sin(27), sin(37),... = 0,0,0,0,... 


since sin(n7) = 0 when n is an integer. Thus lim f(n) = 0. But lim f(x), when z is 


real, does not exist: as x gets bigger and bigger, the values sin(x7) do not get closer and 
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closer to a single value, but take on all values between —1 and 1 over and over. In general, 
whenever you want to know lim f(n) you should first attempt to compute lim f(z), 
noo wL->0CO 


since if the latter exists it is also equal to the first limit. But if for some reason lim f(z) 
wL—->0o 
does not exist, it may still be true that lim f(n) exists, but you’ll have to figure out 
n—- oo 


another way to compute it. 

It is occasionally useful to think of the graph of a sequence. Since the function is 
defined only for integer values, the graph is just a sequence of dots. In figure 11.1.1 we see 
the graphs of two sequences and the graphs of the corresponding real functions. 


5 4 59 
4 4 4 4| 
34 34 
=] 
37 ln) =1/m | 
1 je . 1; 
0 Le hd kd e e e e ° 0 
0 5 10 
x) = sin(aar 
; ; f(z) (x7) 
f(n) = sin(nz) 
0) ¢—_+_* ¢—+ ¢—_+—_-* @) 
12 3 4 5 6 7 8 
—1 —l 


Figure 11.1.1 Graphs of sequences and their corresponding real functions. 


Not surprisingly, the properties of limits of real functions translate into properties of 
sequences quite easily. Theorem 2.3.6 about limits becomes 


THEOREM 11.1.2 Suppose that lim a, = Land lim b, = M and k is some constant. 
noo n> oo 


Then 
lim ka, =k lim an = kL 
noo i mae.) 


lim (a, +b,) = lim a, + lim b, =L+M 
n—0o noo n—0oo 


lim (a, — b,) = lim a, — lim b, =L—M 
n— oo n— oo n—0o 
lit (G.b_) = Nine gi, lime = 
noo n— oo noo 
Qn — liMpsodn OL , 

= = f M t 0 
ob,  fiieaoDn Me 


Likewise the Squeeze Theorem (4.3.1) becomes 
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THEOREM 11.1.3 Suppose that a, <b, <c, for alln > N, for some N. If lim a, = 


N—->Co 


lim c, = L, then lim by, = L. | 
n—- co N—-+ oo 

And a final useful fact: 
THEOREM 11.1.4 lim |a,,| =0 if and only if lim a, =0. a 

noo n—-0o 

This says simply that the size of a, gets close to zero if and only if a, gets close to 
Zero. 
EXAMPLE 11.1.5 Determine whether tzer} converges or diverges. If it con- 

n n=0 
verges, compute the limit. Since this makes sense for real numbers we consider 
1 
lim —— = lim 1- =1-0=1. 
Thus the sequence converges to 1. 
l CO 
EXAMPLE 11.1.6 Determine whether aah converges or diverges. If it con- 
n 
n=1 
verges, compute the limit. We compute 
] il 
lim —~ = lim ae = 
@—oo too =] 


using L’H6pital’s Rule. Thus the sequence converges to 0. 


EXAMPLE 11.1.7 Determine whether {(—1)”"}°°.) converges or diverges. If it con- 
verges, compute the limit. This does not make sense for all real exponents, but the sequence 
is easy to understand: it is 


| me ay ees ee 


and clearly diverges. 


EXAMPLE 11.1.8 Determine whether {(—1/2)"}°°.) converges or diverges. If it con- 
verges, compute the limit. We consider the sequence {|(—1/2)”|}?2.5) = {(1/2)"}72.5. Then 


so by theorem 11.1.4 the sequence converges to 0. 
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EXAMPLE 11.1.9 Determine whether {(sinn)//n}°2, converges or diverges. If it 
converges, compute the limit. Since |sinn| < 1, 0 < |sinn/V/n| < 1/\/n and we can use 


theorem 11.1.3 with a, = 0 and c, =1//n. Since lim a, = lim c, = 0, lim sinn/./n = 
N— Ooo Noo noo 


O and the sequence converges to 0. 


EXAMPLE 11.1.10 A particularly common and useful sequence is {r”}?°_9, for various 
values of r. Some are quite easy to understand: If r = 1 the sequence converges to 1 since 
every term is 1, and likewise if r = 0 the sequence converges to 0. If r = —1 this is 
the sequence of example 11.1.7 and diverges. If r > 1 or r < —1 the terms r” get large 
without limit, so the sequence diverges. If 0 < r < 1 then the sequence converges to 0. 
If —1 < r < 0 then |r”| = |r|” and 0 < |r| < 1, so the sequence {|r|"}72_, converges to 
0, so also {r”}°°_) converges to 0. converges. In summary, {r”} converges precisely when 
—1<r<1 in which case 


lim r” = 
n—-cCo 


i tlre 
1 wpe 


Sometimes we will not be able to determine the limit of a sequence, but we still would 
like to know whether it converges. In some cases we can determine this even without being 
able to compute the limit. 

A sequence is called increasing or sometimes strictly increasing if a; < aj;+1 for 
all 7. It is called non-decreasing or sometimes (unfortunately) increasing if a; < aj+1 
for all 7. Similarly a sequence is decreasing if a; > a;,, for all 2 and non-increasing if 
a; > ai+1 for all 7. If a sequence has any of these properties it is called monotonic. 


EXAMPLE 11.1.11 The sequence 


2-1)" 137 15 
2 | PAS iG 


is increasing, and 


is decreasing. 


A sequence is bounded above if there is some number N such that a, < N for every 
n, and bounded below if there is some number N such that a, > N for every n. If a 
sequence is bounded above and bounded below it is bounded. If a sequence {a,,}?-.9 is 
increasing or non-decreasing it is bounded below (by ao), and if it is decreasing or non- 
increasing it is bounded above (by ag). Finally, with all this new terminology we can state 


an important theorem. 
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THEOREM 11.1.12 If a sequence is bounded and monotonic then it converges. a 


We will not prove this; the proof appears in many calculus books. It is not hard to 
believe: suppose that a sequence is increasing and bounded, so each term is larger than the 
one before, yet never larger than some fixed value N. The terms must then get closer and 
closer to some value between ag and N. It need not be N, since N may be a “too-generous” 
upper bound; the limit will be the smallest number that is above all of the terms a,. 


EXAMPLE 11.1.13 All of the terms (2’ — 1)/2’ are less than 2, and the sequence is 
increasing. As we have seen, the limit of the sequence is 1—1 is the smallest number that 
is bigger than all the terms in the sequence. Similarly, all of the terms (n+ 1)/n are bigger 
than 1/2, and the limit is 1—1 is the largest number that is smaller than the terms of the 


sequence. 


We don’t actually need to know that a sequence is monotonic to apply this theorem— 
it is enough to know that the sequence is “eventually” monotonic, that is, that at some 
point it becomes increasing or decreasing. For example, the sequence 10, 9, 8, 15, 3, 21, 4, 
3/4, 7/8, 15/16, 31/32,... is not increasing, because among the first few terms it is not. 
But starting with the term 3/4 it is increasing, so the theorem tells us that the sequence 
3/4, 7/8, 15/16, 31/32, ... converges. Since convergence depends only on what happens as 
n gets large, adding a few terms at the beginning can’t turn a convergent sequence into a 


divergent one. 


EXAMPLE 11.1.14 Show that {n!/"} converges. 


We first show that this sequence is decreasing, that is, that ns (n+ 1)1/ (+1) Consider 


1/= when « > 1. We can compute the derivative, f’(2) = 


the real function f(z) = x 
x'/*(1—Inx)/x?, and note that when x > 3 this is negative. Since the function has negative 
slope, > (n+ ery when n > 3. Since all terms of the sequence are positive, the 
sequence is decreasing and bounded when n > 3, and so the sequence converges. (As it 


happens, we can compute the limit in this case, but we know it converges even without 


knowing the limit; see exercise 1.) 


EXAMPLE 11.1.15 Show that {n!/n"} converges. 


Again we show that the sequence is decreasing, and since each term is positive the sequence 
converges. We can’t take the derivative this time, as x! doesn’t make sense for x real. But 
we note that if a@,41/a, <1 then a,41 < an, which is what we want to know. So we look 


at diya) On: 


Qn41 (n+1)! nn” (n+1)! ag aaal n ) =( n ya 


ig (nt 1)! nl n! (n+1™41~ n+1 n+1 
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(Again it is possible to compute the limit; see exercise 2.) 


Exercises 11.1. 


1. Compute lim 2'/*. > 
@w—- CO 


! 
2. Use the squeeze theorem to show that lim ene 
noo Nn” 


3. Determine whether {Vn + 47 — \/n}°°_o converges or diverges. If it converges, compute the 


limit. > 
me oa ae 
4. Determine whether ica} converges or diverges. If it converges, compute the limit. 
n 
n=0 
=> 
5. Determine whether { neat converges or diverges. If it converges, compute the 
: Ww —_—. : 
Vn? + 3n) nai 


limit. > 
Qr)\°~ 
6. Determine whether {a} converges or diverges. > 


n=0 


11.2 SERIES 


While much more can be said about sequences, we now turn to our principal interest, 
series. Recall that a series, roughly speaking, is the sum of a sequence: if {a,}?P29 is a 
sequence then the associated series is 


So an = a9 +41 +42 +°°- 
i=0 


Associated with a series is a second sequence, called the sequence of partial sums 


co. 
{Sit =o: 
nm 
Sn = ) aj. 
i=0 
So 
$9=@, $1 =A 141, S82=a91014+ a2, 


A series converges if the sequence of partial sums converges, and otherwise the series 
diverges. 


EXAMPLE 11.2.1 Ifa, = kx”, ys dy, is called a geometric series. A typical partial 
n=0 
sum is 


Sn =kt+katke* +kett+---tke™ =k teta?+a3+--- +2"), 
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We note that 


s,(l—a2) =k +at+a?4+a°4+---+2")(1-2) 
=k(lt+ata?+a°4+---+e")l-kl tata? tae t--- +a" t+ 2")¢ 
=k(ltata?taet. +a" -a2—-2? -g3 —...-g™—-2""") 
= k(1—a"t") 
SO 


1 — grtl 
n= k 
7 1-2 
If lz) <1, dim 2? = 0 se 
noo 
l=g"" | 

lim s, = lim k . = ? 
n—0o noo —27 1-2 


Thus, when |x| < 1 the geometric series converges to k/(1—a). When, for example, k = 1 
and ~ = 1/2: 


1—(1/2)"*? arth if 
5 1-1/2 Qn gn aN = B= 


We began the chapter with the series 
“1 
Dee 
n=1 
namely, the geometric series without the first term 1. Each partial sum of this series is 1 


less than the corresponding partial sum for the geometric series, so of course the limit is 
also one less than the value of the geometric series, that is, 


a | 
dao} 


It is not hard to see that the following theorem follows from theorem 11.1.2. 


THEOREM 11.2.2 Suppose that 5>a, and }>b, are convergent series, and c is a 
constant. Then 


1. y Cady, is convergent and y COn =C y An 
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2. Sn + b,) is convergent and Gx + bn) = > Qn + Dy Bi 


The two parts of this theorem are subtly different. Suppose that 5° a, diverges; does 
> cay, also diverge if c is non-zero? Yes: suppose instead that 45> ca, converges; then by 
the theorem, 5>(1/c)ca, converges, but this is the same as }>a,, which by assumption 
diverges. Hence )*ca,, also diverges. Note that we are applying the theorem with a, 
replaced by ca, and c replaced by (1/c). 

Now suppose that 5>a, and >°b, diverge; does }>(a, + b,) also diverge? Now the 
answer is no: Let a, = 1 and b,, = —1, so certainly }>a, and )>b, diverge. But )>(an + 
bn) = 9511+ -1) = 550 = 0. Of course, sometimes 5 \(a, + 5,,) will also diverge, for 
example, if a, = b, = 1, then })(a@n + bn) = D0(1 +1) = 552 diverges. 

In general, the sequence of partial sums s,, is harder to understand and analyze than 
the sequence of terms a, and it is difficult to determine whether series converge and if so 
to what. Sometimes things are relatively simple, starting with the following. 


THEOREM 11.2.3 If >>a, converges then lim a, = 0. 
oo 


n 
Proof. Since }>a,, converges, lim s, = L and lim s,_1 = L, because this really says 
noo noo 


the same thing but “renumbers” the terms. By theorem 11.1.2, 
lim (S, — Sn-1) = lim s, — lim s,_) = D—L=0. 
nN oco nN co nN oco 
But 
Sn — 8n—-1 = (@9 +41 + a2 +--+ +n) — (29 +1 +024 ---+Gn_-1) = Gn, 


so as desired lim a, = 0. | 
noo 


This theorem presents an easy divergence test: if given a series }* a, the limit lim a, 
Noo 


does not exist or has a value other than zero, the series diverges. Note well that the 


converse is not true: If lim a, = 0 then the series does not necessarily converge. 
Noo 


EXAMPLE 11.2.4 Show that 5 a diverges. 
n 
n=1 


We compute the limit: 
n 


=1#0. 


Looking at the first few terms perhaps makes it clear that the series has no chance of 


lim 
nao n+l 


converging: 


paar ara 
2°3°4°5 
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will just get larger and larger; indeed, after a bit longer the series starts to look very much 
like ---+1+1+1+14---, and of course if we add up enough 1’s we can make the sum 


as large as we desire. 


1 
EXAMPLE 11.2.5 Show that y — diverges. 
n 
n=1 


Here the theorem does not apply: lim 1/n = 0, so it looks like perhaps the series con- 
N+ Cco 


verges. Indeed, if you have the fortitude (or the software) to add up the first 1000 terms 


you will find that 
1000 


So ow 7.49, 


n=1 
so it might be reasonable to speculate that the series converges to something in the neigh- 


borhood of 10. But in fact the partial sums do go to infinity; they just get big very, very 
slowly. Consider the following: 


(ee ee ar on ee ae oe ee ee 
arg a 2°4°4 2° 2 

eee ee ee 
2°3'4'°5'6'7°8 2°4'°4°8°8°8 8 a ae 


eee ee eee eR OR ae ee ee ee eee ee ee 
os 16 2°4°4'8 8 ' 16 a a ae ae 


and so on. By swallowing up more and more terms we can always manage to add at least 


another 1/2 to the sum, and by adding enough of these we can make the partial sums as 
big as we like. In fact, it’s not hard to see from this pattern that 
1+ : + : a at : >1+ = 
2 3 ay 2? 
so to make sure the sum is over 100, for example, we’d add up terms until we get to 
around 1/2'*, that is, about 4-10°° terms. This series, )*(1/n), is called the harmonic 
series. 


Exercises 11.2. 


oo 2 

1. Explain why > al diverges. > 
5B , 

2. Explain why S Rie} 14 diverges. > 


nm 


3. Explain why SS - diverges. > 


n= 
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4 
4. Compute ae, = 


n=0 


11.3 THE INTEGRAL TEST 


It is generally quite difficult, often impossible, to determine the value of a series exactly. 
In many cases it is possible at least to determine whether or not the series converges, and 
so we will spend most of our time on this problem. 

If all of the terms a,, in a series are non-negative, then clearly the sequence of partial 
sums Ss, is non-decreasing. This means that if we can show that the sequence of partial 
sums is bounded, the series must converge. We know that if the series converges, the 
terms a, approach zero, but this does not mean that an > an+41 for every n. Many useful 
and interesting series do have this property, however, and they are among the easiest to 
understand. Let’s look at an example. 


1 
EXAMPLE 11.3.1 Show that y —, converges. 
n 
n=1 


The terms 1/n? are positive and decreasing, and since lim 1/x? = 0, the terms 1/n? 
LOO 


approach zero. We seek an upper bound for all the partial sums, that is, we want to 
find a number N so that s, < N for every n. The upper bound is provided courtesy of 
integration, and is inherent in figure 11.3.1. 

The figure shows the graph of y = 1/x? together with some rectangles that lie com- 
pletely below the curve and that all have base length one. Because the heights of the 
rectangles are determined by the height of the curve, the areas of the rectangles are 1/17, 
1/27, 1/37, and so on—in other words, exactly the terms of the series. The partial sum 
Sy is simply the sum of the areas of the first n rectangles. Because the rectangles all lie 
between the curve and the x-axis, any sum of rectangle areas is less than the corresponding 
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Figure 11.3.1 Graph of y = 1/2? with rectangles. 


area under the curve, and so of course any sum of rectangle areas is less than the area 
under the entire curve, that is, all the way to infinity. There is a bit of trouble at the left 
end, where there is an asymptote, but we can work around that easily. Here it is: 


1 1 1 1 ame | a 
recalling that we computed this improper integral in section 9.7. Since the sequence of 
partial sums s,, is increasing and bounded above by 2, we know that lim s, = L < 2, and 
Noo 


so the series converges to some number less than 2. In fact, it is possible, though difficult, 
to show that L = 17/6 = 1.6. 


We already know that 5>1/n diverges. What goes wrong if we try to apply this 
technique to it? Here’s the calculation: 


ti .d a i me ag 
maptotgtet iit | ade <i — dx = 1+ oo. 
1 2 3 n 1 wv 1 & 


The problem is that the improper integral doesn’t converge. Note well that this does 
not prove that 5*1/n diverges, just that this particular calculation fails to prove that it 
converges. A slight modification, however, allows us to prove in a second way that 5>1/n 
diverges. 


EXAMPLE 11.3.2 Consider a slightly altered version of figure 11.3.1, shown in fig- 
ure 11.3.2. 

The rectangles this time are above the curve, that is, each rectangle completely contains 
the corresponding area under the curve. This means that 


a 1 ie n+l 
maptytgte toe | —dz=I|nz = In(n + 1). 
1 2 3 n 1 x 1 


As n gets bigger, In(n + 1) goes to infinity, so the sequence of partial sums s,, must also 


go to infinity, so the harmonic series diverges. 
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0 1 2 3 4 5 


Figure 11.3.2 Graph of y = 1/2 with rectangles. 


The important fact that clinches this example is that 


n+l 


lim dt = 00; 
noo fy v 


eT, 
is —dz=o. 
1 x 


So these two examples taken together indicate that we can prove that a series converges 


which we can rewrite as 


or prove that it diverges with a single calculation of an improper integral. This is known 
as the integral test, which we state as a theorem. 


THEOREM 11.3.3 Suppose that f(x) > 0 and is decreasing on the infinite interval 


[k, co) (for some & > 1) and that a, = f(n). Then the series S- Gy converges if and only 
n=1 
if the improper integral | f(x) dx converges. rT 
1 


The two examples we have seen are called p-series; a p-series is any series of the form 
S- 1/n”. If p< 0, lim 1/n? ¥ 0, so the series diverges. For positive values of p we can 
noo 


determine precisely which series converge. 


THEOREM 11.3.4 A p-series with p > 0 converges if and only if p > 1. 


Proof. We use the integral test; we have already done p = 1, so assume that p # 1. 
D 


~ 4 : gi-P 2 > Pek 1 
—dzx= lim = lim _ : 
T geP Doo Lp y D>+co 1—p 1-—p 


If p > 1 then 1 — p < 0 and Zim D‘~” = 0, so the integral converges. If 0 < p < 1 then 
—0o 


1—p>0Oand zim D'~? = x, so the integral diverges. rT] 
—0o 
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“1 
EXAMPLE 11.3.5 Show that y — converges. 
n 
n=1 


We could of course use the integral test, but now that we have the theorem we may simply 


note that this is a p-series with p > 1. 


(oe) 


EXAMPLE 11.3.6 Show that > = converges. 
n 


We know that if e 1/n* converges then S- 5/n* also converges, by theorem 11.2.2. Since 


n=1 n=1 


CO CO 
y 1/ n* is a convergent p-series, y 5/ n* converges also. 


n=1 n=1 
EXAMPLE 11.3.7 Show that a a diverges. 


— 1 
This also follows from theorem 11.2.2: Since Ss" Ti is a p-series with p = 1/2 < 1, it 
n 


diverges, and so does ee 
4 /n 


Since it is typically difficult to compute the value of a series exactly, a good approx- 
imation is frequently required. In a real sense, a good approximation is only as good as 
we know it is, that is, while an approximation may in fact be good, it is only valuable in 
practice if we can guarantee its accuracy to some degree. This guarantee is usually easy 
to come by for series with decreasing positive terms. 


EXAMPLE 11.3.8 Approximate Ss" 1/n? to two decimal places. 
N 
Referring to figure 11.3.1, if we approximate the sum by S- 1/n, the error we make is 
n=1 
the total area of the remaining rectangles, all of which lie under the curve 1/2? from « = N 
out to infinity. So we know the true value of the series is larger than the approximation, 
and no bigger than the approximation plus the area under the curve from N to infinity. 
Roughly, then, we need to find N so that 


coe | 
— dx < 1/100. 
Meee 


270 Chapter 11 Sequences and Series 


canal 1 
ee ewes 
‘ x2 oN’ 


so N = 100 is a good starting point. Adding up the first 100 terms gives approximately 
1.634983900, and that plus 1/100 is 1.644983900, so approximating the series by the value 
halfway between these will be at most 1/200 = 0.005 in error. The midpoint is 1.639983900, 
but while this is correct to +0.005, we can’t tell if the correct two-decimal approximation 


We can compute the integral: 


is 1.63 or 1.64. We need to make N big enough to reduce the guaranteed error, perhaps to 
around 0.004 to be safe, so we would need 1/N = 0.008, or N = 125. Now the sum of the 
first 125 terms is approximately 1.636965982, and that plus 0.008 is 1.644965982 and the 
point halfway between them is 1.640965982. The true value is then 1.640965982+0.004, and 
all numbers in this range round to 1.64, so 1.64 is correct to two decimal places. We have 
mentioned that the true value of this series can be shown to be 17/6 © 1.644934068 which 
rounds down to 1.64 (just barely) and is indeed below the upper bound of 1.644965982, 
again just barely. Frequently approximations will be even better than the “guaranteed” 


accuracy, but not always, as this example demonstrates. 


Exercises 11.3. 


Determine whether each series converges or diverges. 


= 4 = n 
Do aa > i aera 
Inn = 1 

3 a 4. S> aaa 
n=1 n=1 
me! “an 

5. a> 6. >> at 
n=1 n=1 
ae _ 1 

7. => 8. => 
2, ninn <~ n(Inn)? 


love) N N 
1 1 1 
9. Find an WN so that y 74 is between y aa and y A +0.005. => 
ae 


n=1 n=1 


love) N N 
1 1 1 - 
10. Find an N so that y ae is between y a and y o +10*.5 


n=0 n=0 n=0 
Find N h Se M7 2 b = Inn A = Inn 
11. Find an so that a ae is between 2, a an Z Te +0.005. > 


fore) N N 
. 1 . 1 
12. Find an N sO that ) n(inn)2 1s between ) n(inn)2 and ) a + 0.005. => 
n=2 n=2 


n=2 
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11.4 ALTERNATING SERIES 


Next we consider series with both positive and negative terms, but in a regular pattern: 
they alternate, as in the alternating harmonic series for example: 


yale a a | re re | 
= n 3 


In this series the sizes of the terms decrease, that is, |a,| forms a decreasing sequence, 
but this is not required in an alternating series. As with positive term series, however, 
when the terms do have decreasing sizes it is easier to analyze the series, much easier, in 
fact, than positive term series. Consider pictorially what is going on in the alternating 
harmonic series, shown in figure 11.4.1. Because the sizes of the terms a,, are decreasing, 
the partial sums s1;, 53, §5;, and so on, form a decreasing sequence that is bounded below 
by s2, so this sequence must converge. Likewise, the partial sums s2, 54, sg, and so on, 
form an increasing sequence that is bounded above by s1, so this sequence also converges. 
Since all the even numbered partial sums are less than all the odd numbered ones, and 
since the “jumps” (that is, the a; terms) are getting smaller and smaller, the two sequences 
must converge to the same value, meaning the entire sequence of partial sums $1, 52, 83,... 


converges as well. 


1 

a= 73 

a4 

a6 
ees! S4 S6 Ss | 83 | 
a 2 l=s,;=a, 
4 a5 

a3 


Figure 11.4.1 The alternating harmonic series. 


There’s nothing special about the alternating harmonic series—the same argument 
works for any alternating sequence with decreasing size terms. The alternating series test 
is worth calling a theorem. 


THEOREM 11.4.1 Suppose that {a,}°2, is a non-increasing sequence of positive 


Co 


numbers and lim a, = 0. Then the alternating series S$ >(-1) 


n—->oco 


nla, converges. 


n=1 
Proof. The odd numbered partial sums, 51, 53, 55, and so on, form a non-increasing 
sequence, because 52443 = Sak+1 — Gak+2 + Gen+3 < S2n41, Since Go~42 > Gox43. This 
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sequence is bounded below by sz, so it must converge, say jim Sort41 = L. Likewise, 
Ooo 


the partial sums s2, 54, sg, and so on, form a non-decreasing sequence that is bounded 


above by 81, so this sequence also converges, say lim so, = M. Since lim a, = 0 and 
k—- oo n— oo 


SQk+1 = S2k + G2K4+1, 
C= lim sop41 = lim (S04 + Gap41) = lim sox + lim agp4, =M+0=M, 
k->00 k->o00o k- 00 k->00 


so [ = M, the two sequences of partial sums converge to the same limit, and this means 
the entire sequence of partial sums also converges to L. a 


Another useful fact is implicit in this discussion. Suppose that 


BS (ay 
n=1 
and that we approximate L by a finite part of this sum, say 
N 
Les S>(-1)" "an. 
n=1 


Because the terms are decreasing in size, we know that the true value of L must be between 
this approximation and the next one, that is, between 


N N+1 
Ss >(-1)” la, and S° (=1)""*a,, 
n=1 n=1 


Depending on whether N is odd or even, the second will be larger or smaller than the first. 


EXAMPLE 11.4.2 Approximate the alternating harmonic series to one decimal place. 

We need to go roughly to the point at which the next term to be added or subtracted 
is 1/10. Adding up the first nine and the first ten terms we get approximately 0.746 and 
0.646. These are 1/10 apart, but it is not clear how the correct value would be rounded. It 
turns out that we are able to settle the question by computing the sums of the first eleven 
and twelve terms, which give 0.737 and 0.653, so correct to one place the value is 0.7. 


We have considered alternating series with first index 1, and in which the first term is 


positive, but a little thought shows this is not crucial. The same test applies to any similar 


series, such as S"(-1)"an, S°(=1)"4n, S- (—1)"an, ete. 


n=0 n=1 n=17 
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Exercises 11.4. 


Determine whether the following series converge or diverge. 


= an+5 7 & Vn=3 

= n—-1 n — a4 Le 
yt = 4 oa 

n=1 n=1 


— 1 
2 n—l1 a 
5. Approximate y (—1) 73 to two decimal places. => 


n=1 


. = n-1 1 . 
6. Approximate S>(-1) a to two decimal places. => 


n=1 


11.5 COMPARISON TESTS 


As we begin to compile a list of convergent and divergent series, new ones can sometimes 


be analyzed by comparing them to ones that we already understand. 


EXAMPLE 11.5.1 Does y — converge? 
n7 Inn 
n=2 


The obvious first approach, based on what we know, is the integral test. Unfortunately, 
we can’t compute the required antiderivative. But looking at the series, it would appear 
that it must converge, because the terms we are adding are smaller than the terms of a 


p-series, that is, 
1 1 


— < ae 
n2Inn ~~ n? 
when n > 3. Since adding up the terms 1/n? doesn’t get “too big”, the new series “should” 


also converge. Let’s make this more precise. 


CO CO 
The series y ——— converges if and only if y —— converges—all we’ve done is 
2] 2] 
4n?lnn 4m? Inn 
n= n= 


1 
dropped the initial term. We know that y —; converges. Looking at two typical partial 
n 
n=3 


sums: 


ee ee ee ee ee ae 
~ 32In3. 421In4.—sO52 In 5 n?7Inn 32 42 52 ne” 


Sn 


Since the p-series converges, say to L, and since the terms are positive, t, < L. Since the 
terms of the new series are positive, the s,, form an increasing sequence and s, < tn < L 


for all n. Hence the sequence {s,,} is bounded and so converges. 
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Sometimes, even when the integral test applies, comparison to a known series is easier, 
so it’s generally a good idea to think about doing a comparison before doing the integral 
test. 


| sin n| 
5 converge? 


EXAMPLE 11.5.2 Does S- 


n 
n=1 


We can’t apply the integral test here, because the terms of this series are not decreasing. 
Just as in the previous example, however, 


| sin n| 1 


because |sinn| < 1. Once again the partial sums are non-decreasing and bounded above 


by so 1/n? = L, so the new series converges. 


Like the integral test, the comparison test can be used to show both convergence and 
divergence. In the case of the integral test, a single calculation will confirm whichever is 
the case. To use the comparison test we must first have a good idea as to convergence or 
divergence and pick the sequence for comparison accordingly. 


EXAMPLE 11.5.3. Does S° 


n=2 


1 
———— converge? 
Vn? —3 

We observe that the —3 should have little effect compared to the n? inside the square 
root, and therefore guess that the terms are enough like 1/Vn? = 1/n that the series 
should diverge. We attempt to show this by comparison to the harmonic series. We note 
that 


1 1 1 
= ——e 
Jn? — 3 Vn2 nm 
so that 
pikes ae : es : aa dic ey 
cane eae ary eer nm—3 2 8 un 


where t, is 1 less than the corresponding partial sum of the harmonic series (because we 


start at n = 2 instead of n= 1). Since lim t, = oo, lim s,, = 00 as well. 
Noo noo 


So the general approach is this: If you believe that a new series is convergent, attempt 
to find a convergent series whose terms are larger than the terms of the new series; if you 
believe that a new series is divergent, attempt to find a divergent series whose terms are 
smaller than the terms of the new series. 


11.5 Comparison Tests 275 


converge? 


= i 
EXAMPLE 11.5.4 Does —— 
d Vn? +3 
Just as in the last example, we guess that this is very much like the harmonic series 


and so diverges. Unfortunately, 
1 


1 
—— < 5 
ne+3 on 


so we can’t compare the series directly to the harmonic series. A little thought leads us to 


i] 1 1 


> a ’ 
Vn2+3°  WVn?+3n? = 2n 


so if }>1/(2n) diverges then the given series diverges. But since $5 1/(2n) = (1/2) $3 1/n, 
theorem 11.2.2 implies that it does indeed diverge. 


For reference we summarize the comparison test in a theorem. 


THEOREM 11.5.5 Suppose that a, and b, are non-negative for all n and that an < by 
when n > N, for some N. 


If S- b, converges, so does > An: 


n=0 n=0 
CoO CO 

If SS Gn diverges, so does » bn. 
n=0 n=0 


Exercises 11.5. 


Determine whether the series converge or diverge. 


= 1 _ 1 
Iz —__—_ > 2s SS 
22 Int pan FB 24 Int ton 8 
oo i = 3n+ 4 
4. 
2 26 i=in a8 26 int + 3n 45 
Bn? +4 “Inn 
5 6. ——— 
2s mt + 3n 45 a n 
om Se 
3 : Inn 
n=1 n=2 
9 => 10. = 
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11.6 ABSOLUTE CONVERGENCE 


Roughly speaking there are two ways for a series to converge: As in the case of $*1/n?, 
the individual terms get small very quickly, so that the sum of all of them stays finite, or, 
as in the case of So(-1)""1/n, the terms don’t get small fast enough (> 1/n diverges), 
but a mixture of positive and negative terms provides enough cancellation to keep the 
sum finite. You might guess from what we’ve seen that if the terms get small fast enough 
to do the job, then whether or not some terms are negative and some positive the series 


converges. 


THEOREM 11.6.1 If Ss |a,| converges, then S- Gn, converges. 


n=0 n=0 


Proof. Note that 0 < a,+]an| < 2/a,| so by the comparison test S/(antlan|) converges. 


n=0 
Now 
PG + |@n|) — » lan| = yah + |an| — |an| = yi 
n=0 n=0 n=0 n=0 
converges by theorem 11.2.2. rT] 


So given a series 4>a,, with both positive and negative terms, you should first ask 
whether 5° |a,| converges. This may be an easier question to answer, because we have 
tests that apply specifically to series with non-negative terms. If 5>|a,,| converges then 
you know that 5*a, converges as well. If }*|a,| diverges then it still may be true that 
> dn converges—you will have to do more work to decide the question. Another way to 
think of this result is: it is (potentially) easier for }*> a, to converge than for 5° |a,| to 
converge, because the latter series cannot take advantage of cancellation. 

If 5° |a,,| converges we say that 5° a, converges absolutely; to say that 5° a,, converges 
absolutely is to say that any cancellation that happens to come along is not really needed, 
as the terms already get small so fast that convergence is guaranteed by that alone. If 
> an converges but > |a,| does not, we say that 5>a, converges conditionally. For 


— i — i 
example So(-)" 1S converges absolutely, while So(-1)""*= converges conditionally. 
n n 


n=l n=1 


sinn 
5 converge? 


EXAMPLE 11.6.2 Does S- 


n=2 


n 
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In example 11.5.2 we saw that Ea converges, so the given series converges abso- 
n 
n=2 
lutely. 
— 3n+4 
ajye_ ts vd 
EXAMPLE 11.6.3 Does 2 1) med Bn pe Converse’ 


a Bn +4 eS 
Taking the absolute value, S- 503 Dane diverges by comparison to > —., so if the 
par! 2n*+3n+5 ar 10n 


series converges it does so conditionally. It is true that lim (3n + 4)/(2n? + 3n +5) = 0, 
n—- oo 


so to apply the alternating series test we need to know whether the terms are decreasing. 
If we let f(x) = (8a + 4)/(2a? + 3245) then f’(x) = —(6x? + 16a — 3)/(2Qx? + 3a + 5)?, 
and it is not hard to see that this is negative for x > 1, so the series is decreasing and by 


the alternating series test it converges. 


Exercises 11.6. 


Determine whether each series converges absolutely, converges conditionally, or diverges. 


. - 1 = = 3n? +4 
1. i ae => 2. es 
mm ) Qn? + 8n+5 2 ) 2n? + 3n+5 
_ n_-1 Inn _ n-1 Inn 
3. S0(-1)" '— = 4. S-(-1) e 
n=1 n=1 
.. Sey = 6 Ser 
= Inn = 2” + 6” 
— gS - n—1 arctann 
7. > (-1) Ta Mad 8. > (-1) —< 
n=0 n=1 


11.7 THE RATIO AND Root TESTS 


CO 5 
n : 
Does the series >. Bn converge? It is possible, but a bit unpleasant, to approach this 


n=0 
with the integral test or the comparison test, but there is an easier way. Consider what 


happens as we move from one term to the next in this series: 


n> (n+1)° 
5r 5nt+1 


The denominator goes up by a factor of 5, 5"*! = 5-5", but the numerator goes up by 


much less: (n+ 1)? = n° +5n* + 10n? + 10n? + 5n +1, which is much less than 5n° when 


5 


n is large, because 5n* is much less than n°. So we might guess that in the long run it 
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begins to look as if each term is 1/5 of the previous term. We have seen series that behave 


like this: 
Sas 
mr 5r OA 


a geometric series. So we might try comparing the given series to some variation of this 
geometric series. This is possible, but a bit messy. We can in effect do the same thing, 
but bypass most of the unpleasant work. 

The key is to notice that 


This is really just what we noticed above, done a bit more officially: in the long run, each 
term is one fifth of the previous term. Now pick some number between 1/5 and 1, say 1/2. 
Because 


lim eT : 


n>0CO An, 5’ 


then when n is big enough, say n > N for some N, 


a 1 a 
SEE gee: and Anti <—. 
An 2 2 


So ani1 < Gn /2, Gni2 < an41/2 < an/4, ang3 < Gn42/2 < an4i/4 < an/8, and so 
on. The general form is any, < ay /2*. So if we look at the series 


CO 


Sante =@n +4n41 + On42+Gn43 45°: +aN¢R++°° 
k=0 


its terms are less than or equal to the terms of the sequence 


an | aN , an an — aN 
a eee — see —_ = 2 . 
Gg og ge Do oe = 2an 
k=0 
So by the comparison test, YS" ane converges, and this means that S- Gn converges, 
k=0 n=0 


since we’ve just added the fixed number ap + a, +---+ay_1. 

Under what circumstances could we do this? What was crucial was that the limit of 
An+41/Qn, say L, was less than 1 so that we could pick a value r so that L <r < 1. The 
fact that L < r (1/5 < 1/2 in our example) means that we can compare the series }> an 
to Sor”, and the fact that r < 1 guarantees that }>r” converges. That’s really all that is 
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required to make the argument work. We also made use of the fact that the terms of the 
series were positive; in general we simply consider the absolute values of the terms and we 
end up testing for absolute convergence. 


THEOREM 11.7.1 The Ratio Test Suppose that lim ja,41/a,| = Ll. If Db <1 
n—- oo 


the series }* a, converges absolutely, if L > 1 the series diverges, and if L = 1 this test 


gives no information. 


Proof. The example above essentially proves the first part of this, if we simply replace 
1/5 by LZ and 1/2 by r. Suppose that Z > 1, and pick r so that 1 < r < L. Then for 
n> N, for some N, 


>r and |an4i| > rian. 
lan| 


This implies that jJav+.| > r*lan|, but since r > 1 this means that jim lan+z| #0, which 
oo 
means also that lim a, #0. By the divergence test, the series diverges. 
noo 


To see that we get no information when L = 1, we need to exhibit two series with 
L = 1, one that converges and one that diverges. It is easy to see that S>1/n? and S*1/n 
do the job. rT 


EXAMPLE 11.7.2 The ratio test is particularly useful for series involving the factorial 


function. Consider S- 5” /n!. 


n=0 


5rtl nl _ gett al 


Fire eee 7 ee 
noo (N+ 115" nto 5” (n+1)! noo (n+1) 


Since 0 < 1, the series converges. 


A similar argument, which we will not do, justifies a similar test that is occasionally 
easier to apply. 


THEOREM 11.7.3. The Root Test Suppose that lim |a,|!/" = L. If L < 1 the 
n—- oo 


series )> a, converges absolutely, if L > 1 the series diverges, and if L = 1 this test gives 


no information. P| 


The proof of the root test is actually easier than that of the ratio test, and is a good 
exercise. 


EXAMPLE 11.7.4 Analyze )~ we, 


n 
n=0 
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The ratio test turns out to be a bit difficult on this series (try it). Using the root test: 


n\ 1/n n)\1/n 
lim (=) = lim oe. = lim 2 = 0. 


noo \ n”™ 


Since 0 < 1, the series converges. 


The root test is frequently useful when n appears as an exponent in the general term 
of the series. 


Exercises 11.7. 


1. Compute lim |an+41/an| for the series )>1/n?. 
n—-co 


2. Compute lim |an+1/an| for the series }*1/n. 


n—+>co 


3. Compute lim |a,|'/” for the series 37 1/n?. 


n—+>co 


4. Compute lim |a,|'/” for the series 37 1/n. 


n—->co 


Determine whether the series converge. 


2. @ 


9. Prove theorem 11.7.3, the root test. 


11.8 POWER SERIES 


Recall that we were able to analyze all geometric series “simultaneously” to discover that 


CO _ k 
22 ak pate 


if || < 1, and that the series diverges when |x| > 1. At the time, we thought of x as an 
unspecified constant, but we could just as well think of it as a variable, in which case the 
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series 
CO 
) ka” 
n=0 


is a function, namely, the function k/(1— <x), as long as |x| < 1. While k/(1 — x) is a rea- 
sonably easy function to deal with, the more complicated 5° kx” does have its attractions: 
it appears to be an infinite version of one of the simplest function types—a polynomial. 
This leads naturally to the questions: Do other functions have representations as series? 
Is there an advantage to viewing them in this way? 

The geometric series has a special feature that makes it unlike a typical polynomial— 
the coefficients of the powers of x are the same, namely k. We will need to allow more 
general coefficients if we are to get anything other than the geometric series. 


DEFINITION 11.8.1 A power series has the form 


CO 
y Gan, 
n=0 


with the understanding that a, may depend on n but not on x. 


CO gn 
EXAMPLE 11.8.2 y — is a power series. We can investigate convergence using the 
n 
n=1 
ratio test: 
; Fa ea n 


Tw 7 _dim |e = |z| 


es n+1 Lae| n—0o n+1 


Thus when |z| < 1 the series converges and when |x| > 1 it diverges, leaving only two values 

in doubt. When x = 1 the series is the harmonic series and diverges; when x = —1 it is the 

alternating harmonic series (actually the negative of the usual alternating harmonic series) 
CO gn 

and converges. Thus, we may think of as — as a function from the interval |—1,1) to 


n=1 


the real numbers. 


A bit of thought reveals that the ratio test applied to a power series will always have 
the same nice form. In general, we will compute 


n+1 
am (entillel plental — iot tim Mental — py, 


noo |an||a]” noo lan| noo |an| 


assuming that lim |a,+1|/|@,| exists. Then the series converges if L|z| < 1, that is, if 
|x| < 1/L, and diverges if |x| > 1/L. Only the two values x = +1/L require further 
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investigation. Thus the series will definitely define a function on the interval (—1/L,1/L), 
and perhaps will extend to one or both endpoints as well. Two special cases deserve 
mention: if L = 0 the limit is 0 no matter what value x takes, so the series converges for 
all x and the function is defined for all real numbers. If LZ = oo, then no matter what 
value x takes the limit is infinite and the series converges only when x = 0. The value 1/L 
is called the radius of convergence of the series, and the interval on which the series 
converges is the interval of convergence. 


Consider again the geometric series, 


CO e 1 
237 =e 


Whatever benefits there might be in using the series form of this function are only avail- 
able to us when x is between —1 and 1. Frequently we can address this shortcoming by 
modifying the power series slightly. Consider this series: 


(a +2)” wafe+2\" 1 — 8 
a 3” ->( 3 ) [See ae 


n=0 n=0 3 


because this is just a geometric series with x replaced by (a+ 2)/3. Multiplying both sides 
by 1/3 gives 


er oe! 
grt ee 


n=0 


the same function as before. For what values of x does this series converge? Since it is a 


geometric series, we know that it converges when 


ja + 2|/3 <1 
jn +2| <3 
—-3<2@4+2<3 
=o <7 <i. 


So we have a series representation for 1/(1—) that works on a larger interval than before, 
at the expense of a somewhat more complicated series. The endpoints of the interval of 
convergence now are —5 and 1, but note that they can be more compactly described as 
—2+3. We say that 3 is the radius of convergence, and we now say that the series is 
centered at —2. 
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DEFINITION 11.8.3 A power series centered at a has the form 


CO 
_ An(z — a)”, 
n=0 


with the understanding that a, may depend on n but not on 2. 


Exercises 11.8. 


Find the radius and interval of convergence for each series. In exercises 3 and 4, do not attempt 
to determine whether the endpoints are in the interval of convergence. 


n 


1. See 2. yee 


n=0 n=0 
“ni “nl 
3 oo at 4. 5 (@-2)" > 
mn nn 
n=1 n=1 
 (n!)? : (a +5)” 


11.9 CALCULUS WITH POWER SERIES 


Now we know that some functions can be expressed as power series, which look like infinite 
polynomials. Since calculus, that is, computation of derivatives and antiderivatives, is easy 
for polynomials, the obvious question is whether the same is true for infinite series. The 


answer is yes: 


THEOREM 11.9.1 Suppose the power series f(x) = > Gala — a)” has radius of 
n=0 


convergence R. Then 
f(x) = nan (a — a)", 
n=0 


lee) a ' 
[fe)dr=c+¥ Sea) aa 


and these two series have radius of convergence R as well. = 
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EXAMPLE 11.9.2 Starting with the geometric series: 


1 — 
ie 


CO 


1 1 
i dz = —In|1-—2|= a 
La “wet 1 
inieul= = : got 
9 ee 1 
when |x| < 1. The series does not converge when x = 1 but does converge when x = —1 


or 1—a = 2. The interval of convergence is [—1,1), or 0 < 1—2 < 2, so we can use the 
series to represent In(z) when 0 < x < 2. For example 


CO 


In(3/2) = In(t = -1/2) = 0(-1)"" 


n= 


and so 


In(3/2) 1 1 + 1 1 5, 1 1 4 1 909 
n i — — = 
2 8 24 64 160 384 896 2240 


Because this is an alternating series with decreasing terms, we know that the true value 
is between 909/2240 and 909/2240 — 1/2048 = 29053/71680 = .4053, so correct to two 
decimal places the value is 0.41. 

What about In(9/4)? Since 9/4 is larger than 2 we cannot use the series directly, but 


~ 0.406. 


In(9/4) = In((3/2)) = 21n(3/2) = 0.82, 


so in fact we get a lot more from this one calculation than first meets the eye. To estimate 
the true value accurately we actually need to be a bit more careful. When we multiply by 
two we know that the true value is between 0.8106 and 0.812, so rounded to two decimal 


places the true value is 0.81. 


Exercises 11.9. 


Find a series representation for n2. > 
Find a power series representation for 1/(1—«)*. > 
Find a power series representation for 2/(1 — x)°. > 


Find a power series representation for 1/(1 — x)*. What is the radius of convergence? > 


a ae ee 


Find a power series representation for / In(l — x)dz. => 
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11.10 TAYLOR SERIES 


We have seen that some functions can be represented as series, which may give valuable 
information about the function. So far, we have seen only those examples that result from 
manipulation of our one fundamental example, the geometric series. We would like to start 


with a given function and produce a series to represent it, if possible. 


Suppose that f(x) = Ss" a,x” on some interval of convergence. Then we know that 
n=0 
we can compute derivatives of f by taking derivatives of the terms of the series. Let’s look 
at the first few in general: 


fia) = ys na,c”—) = a, + 2agx + 3agx7 + 4agx? + --- 
R= 1. 


(oe) 


fa) = » n(n — lan"? = 2a2 +3-2a3x-+4-3asz7 +--- 
n=2 

a S- n(n —1)(n — 2)anx" 3 = 3-203 +4-3-2agr4--- 
n=3 


By examining these it’s not hard to discern the general pattern. The kth derivative must 
be 


CO 


f(x) = S*n(n-1)(n-2)-++(n—k+ Lane” 


n=k 
= k(k —1)(k —2)---(2)(1)ay + (K+ 1)(k) +++ (2)angie + 
+ (hE 2) (kb 1) +22 (B)agsae? +e: 


We can shrink this quite a bit by using factorial notation: 


= k +2)! 
f(z) = S- aap” =klay + (k+ 1)laygix + ON aso” fe 
n=k ‘ , 
Now substitute x = 0: 
sts | 
f (0) = Blan + S> —an0"* = hla 
n=k+1 ( a )! 
and solve for az: 
— f® 0) 
eer 7 


Note the special case, obtained from the series for f itself, that gives f(0) = ao. 
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So if a function f can be represented by a series, we know just what series it is. Given 
a function f, the series 


of 0 : 
FLO, 


a n! 
is called the Maclaurin series for f. 


EXAMPLE 11.10.1 Find the Maclaurin series for f(x) = 1/(1— 2). We need to 
compute the derivatives of f (and hope to spot a pattern). 


f(z) =(1-a)7? 

f'(@)=(l-a)* 
f"(a) =201-a)° 
f(a) = 6(1—a)~* 
f(x) =4'0 — 2) 


So 


and the Maclaurin series is 


CO CO 
y a= y me 
n=0 n=0 


the geometric series. 


A warning is in order here. Given a function f we may be able to compute the 
Maclaurin series, but that does not mean we have found a series representation for f. We 
still need to know where the series converges, and if, where it converges, it converges to 
f(a). While for most commonly encountered functions the Maclaurin series does indeed 
converge to f on some interval, this is not true of all functions, so care is required. 

As a practical matter, if we are interested in using a series to approximate a function, 
we will need some finite number of terms of the series. Even for functions with messy 
derivatives we can compute these using computer software like Sage. If we want to know the 
whole series, that is, a typical term in the series, we need a function whose derivatives fall 
into a pattern that we can discern. A few of the most important functions are fortunately 


very easy. 
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EXAMPLE 11.10.2 Find the Maclaurin series for sin x. 

The derivatives are quite easy: f’(x) = cosa, f”(x) = —sinaz, f’(x) = —cosa, 
Fie (x) = sina, and then the pattern repeats. We want to know the derivatives at zero: 1, 
0, —1, 0, 1, 0, —1, 0,..., and so the Maclaurin series is 


it 
ear BL Yc "OnE OL 


We should always determine the radius of convergence: 


; |jx|??*3 (2n +1)! ; |x|? 
lim = hm 
n—rco (2n+3)! |x|?"t+1 — n-v00 (2n + 3)(2n + 2) 


= 0, 


so the series converges for every x. Since it turns out that this series does indeed converge 


to sinaz everywhere, we have a series representation for sinz for every x. Here is an 


interactive plot of the sine and some of its series approximations. 


Sometimes the formula for the nth derivative of a function f is difficult to discover, 
but a combination of a known Maclaurin series and some algebraic manipulation leads 
easily to the Maclaurin series for f. 


EXAMPLE 11.10.3 Find the Maclaurin series for x sin(—2). 
To get from sinz to xsin(—x) we substitute —x for x and then multiply by x. We can 
do the same thing to the series for sin x: 


grr g2rtl co 2n+2 


“yo a oe ier (Qn +1)! =a (Qn +1)! 


As we have seen, a general power series can be centered at a point other than zero, 
and the method that produces the Maclaurin series can also produce such series. 


EXAMPLE 11.10.4_ Find a series centered at —2 for 1/(1— 2). 


If the series is D> an(a + 2)” then looking at the kth derivative: 


n=0 
k! 1 —k-1 __ — n) 9) n—k 
1(1— a) = 2 amine ) 
and substituting « = —2 we get k!3~*~! = kla, and a, = 3-*~! = 1/3**1, so the series is 
3 (2 + 2)” 
n+1 ° 
n=0 3 


We’ve already seen this, on page 282. 
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Such a series is called the Taylor series for the function, and the general term has 


the form 
f(a) 


n! 


(2 —a)”. 
A Maclaurin series is simply a Taylor series with a = 0. 


Exercises 11.10. 


For each function, find the Maclaurin series or Taylor series centered at a, and the radius of 
convergence. 


1. cosx => 
a = 
ie 65 = 
._Ingz,a=1> 


. 1/z?,a=1> 

.i/Vvl-z> 

. Find the first four terms of the Maclaurin series for tan x (up to and including the x® term). 
=> 


2 
3 
4 
5. Invz,a=2> 
6 
7 
8 


9. Use a combination of Maclaurin series and algebraic manipulation to find a series centered 
at zero for x cos(x”). > 


10. Use a combination of Maclaurin series and algebraic manipulation to find a series centered 


xz 


at zero for re ©. => 


11.11 TAYLOR’S THEOREM 


One of the most important uses of infinite series is the potential for using an initial portion 
of the series for f to approximate f. We have seen, for example, that when we add up the 
first n terms of an alternating series with decreasing terms that the difference between this 
and the true value is at most the size of the next term. A similar result is true of many 


Taylor series. 


THEOREM 11.11.1 Suppose that f is defined on some open interval J around a and 
suppose f iN +1) (zr) exists on this interval. Then for each x 4 a in I there is a value z 


between x and a so that 


NFO ag (N+1)(y 
f(a) = > ean 4 Fe — ay, 
a , 
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Proof. The proof requires some cleverness to set up, but then the details are quite 
elementary. We want to define a function F(t). Start with the equation 


N 


F(t) = S- f'® (2 —t)"+ Bie —2t)8t. 


n! 
n=0 


Here we have replaced a by ¢ in the first N + 1 terms of the Taylor series, and added a 
carefully chosen term on the end, with B to be determined. Note that we are temporarily 
keeping x fixed, so the only variable in this equation is t, and we will be interested only in 
t between a and x. Now substitute t = a: 


n=0 nl 
Set this equal to f(z): 
NF™ 
f(s) = a u a) (2 —a)" + Bia —a)**. 
n=0 : 


Since x # a, we can solve this for B, which is a “constant”—it depends on x and a but 
those are temporarily fixed. Now we have defined a function F'(t) with the property that 
F(a) = f(x). Consider also F(x): all terms with a positive power of (2 — t) become zero 
when we substitute x for t, so we are left with F(x) = f(x)/0! = f(x). So F(t) isa 
function with the same value on the endpoints of the interval [a,x]. By Rolle’s theorem 
(6.5.1), we know that there is a value z € (a,x) such that F’(z) = 0. Let’s look at F’(t). 
Each term in F'(t), except the first term and the extra term involving B, is a product, so 
to take the derivative we use the product rule on each of these terms. It will help to write 
out the first few terms of the definition: 


fw) 
3! 


IMO, 
U 
FM) 
NI 


(2) 
ama), 


5 (2 —t)?+--- 


F(t) = f(t)+ 2—t)' + a mes 


-- (c-t)% + Bie —0)*. 
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Now take the derivative: 


Fe =70+ (FPe-11-+ Fe -»') 
+ (2 au (x — t)*(—1) + al (a -1)?} 
+ (Z alu (a —t)?(—1) + ao (x -1)5} es 
«(fear iye 
+ B(N + 1)(2 —t)%(-1) 


Now most of the terms in this expression cancel out, leaving just 


(N+1) 
Big = Pe —t)N + B(N +1)(a — t)% (-1). 
At some z, F’(z) = 0 so 
(N+1) 
= u — (2) (a— z)" + BIN + 1)(2 —z)* (-1) 
(N+1) 
BIN +1)(2=—2)" = u ae (2) (x —z)% 
fONFD (2) 
~ (N+ 1)! 
Now we can write 
N 
da) gash ) 
F(t) = = =f ee (Way @ Sa 
Recalling that F(a) = f(a) we get 
Np) (N+1) 
fa) = I e-0)"+ ea ea 
which is what we wanted to show. = 


It may not be immediately obvious that this is particularly useful; let’s look at some 


examples. 
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EXAMPLE 11.11.2 Find a polynomial approximation for sinx accurate to +0.005. 
From Taylor’s theorem: 


Nf) (N+1) 
sinx = dX u “a (2—a)" + ane rae 


What can we say about the size of the term 


aes 


N+19 
(N +1)! 


(x — a) 


Every derivative of sin x is +sinx or tcosz, so |f(%*)(z)| <1. The factor (x —a)%*? is 
a bit more difficult, since x — a could be quite large. Let’s pick a = 0 and || < 7/2; if we 
can compute sin for x € [—7/2,7/2], we can of course compute sin x for all x. 


We need to pick N so that 


N+ 
0.005. 
(N+ 1)!| ~ 
Since we have limited x to |—7/2, 7/2], 
gNrl 2 9N+1 
(N +1)! (N+ 1)! 


The quantity on the right decreases with increasing N, so all we need to do is find an N 


so that 
9N+1 


A little trial and error shows that N = 8 works, and in fact. 29/9! < 0.0015, so 


Sr ACOl( 0) 
sinz = Ss" ro) x” + 0.0015 
n=0 . 


73 7? i 


xv 
™~— * 790” 5040 


+ 0.0015. 


Figure 11.11.1 shows the graphs of sinz and and the approximation on [0, 37/2]. As x 


gets larger, the approximation heads to negative infinity very quickly, since it is essentially 


acting like —a". 
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1a 


0 
-1- 


24 
= 3521 
44 


= 
Figure 11.11.1 sina and a polynomial approximation. (AP) 


We can extract a bit more information from this example. If we do not limit the value 
of x, we still have 


‘ae gel on 
(NV +1)! ~ |(N +1)! 
so that sin x is represented by 
N FO) N+ 
SEO ng | 2. 
—: (N +1)! 
If we can show that 
Nt 
=() 
im ay 
for each x then 
Be ey et Oe Sy a 
ae nl eS) (Qn +1)!’ 


that is, the sine function is actually equal to its Maclaurin series for all x. How can we 
prove that the limit is zero? Suppose that N is larger than |x|, and let M be the largest 
integer less than |x| (if MM = 0 the following is even easier). Then 


oN tel tel tel del delet al a 
(N+1)! N+1NN-1 M+1MM-1 21 
N+1 MM-1 21 
_ _iel_|zl¥ 
— N+1 M!" 


The quantity |2|“/M! is a constant, so 


lel leh 
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and by the Squeeze Theorem (11.1.3) 


Nt 
lim |———_|=0 
as desired. Essentially the same argument works for cos x and e*; unfortunately, it is more 
difficult to show that most functions are equal to their Maclaurin series. 


EXAMPLE 11.11.3 Find a polynomial approximation for e” near x = 2 accurate to 
+0.005. 
From Taylor’s theorem: 


a e? e* 
ae — 2)" _9 N+1 
ae ss vive | 


since f(") (x) = e® for all n. We are interested in x near 2, and we need to keep |(a—2)%*1| 


in check, so we may as well specify that |~ — 2| < 1, so x € [1,3]. Also 


e? 


= (N40! 


e* 


oe 


so we need to find an N that makes e?/(N + 1)! < 0.005. This time N = 5 makes 
e?/(N +1)! < 0.0015, so the approximating polynomial is 


e2 2 2 


e2 
ec =e + e2%(x — 2) + 5 (e- 2)? + S(@-2)° + Se - 24 + ~_(g — 2)5 + 0.0015. 


120 


This presents an additional problem for approximation, since we also need to approximate 


e”, and any approximation we use will increase the error, but we will not pursue this 


complication. 


Note well that in these examples we found polynomials of a certain accuracy only on 
a small interval, even though the series for sina and e” converge for all 2; this is typical. 
To get the same accuracy on a larger interval would require more terms. 


Exercises 11.11. 


1. Find a polynomial approximation for cosx on [0,7], accurate to +107° 

2. How many terms of the series for In x centered at 1 are required so that the guaranteed error 
on [1/2,3/2] is at most 107°? What if the interval is instead [1,3/2]? > 

3. Find the first three nonzero terms in the Taylor series for tan x on [—7/4, 7/4], and compute 


the guaranteed error term as given by Taylor’s theorem. (You may want to use Sage or a 
similar aid.) > 
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4. Show that cosa is equal to its Taylor series for all « by showing that the limit of the error 
term is zero as N approaches infinity. 


5. Show that e* is equal to its Taylor series for all x by showing that the limit of the error term 
is zero as N approaches infinity. 


11.12 ADDITIONAL EXERCISES 


These problems require the techniques of this chapter, and are in no particular order. Some 
problems may be done in more than one way. 


Determine whether the series converges. 


er 
5 a ee 
i ae Oe ae a 
= n 
So ae 
o> WA) 
=< wl 
4. ga? 
n=0 
3 5 7 9 
5. 1 } => 
4 8 12 16 
_ 1 
6. S > => 
a n2+A4 
oO 3 
sin” (7) 
7 n2 = 
n=0 
= nm 
8. a> 
n=0 
9 > => 
44+1-3-5---(Qn—1) 
10 de 
nag es 


11. 


a ee eet = 


— (2n)! 
13. » a => 

n=0 : 

i) _4)\n-1 
ig yo 


15 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 
26. 


27. 


28. 


29. 


30. 
31. 
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oo grgn-l 
Dees eae 

n=1 . 
52 54 58 58 
et@apt@aoe'@d6eap 


Y= sin(1/n) => 


Find the interval and radius of convergence; you need not check the endpoints of the intervals. 


n 


ct => 


n 
nt 


lage: 
e é| 8 
1 


8 
+ 
8 
w 
—_ 
w 
8 
oO 
+ 
— 
w 
or 
8 
“I 


ie. 
w 
bo 
sw 
Ol 
bo 
aw 
[o>) 
~J 


lag: 
3, 

oy 

I) 


Pale 
eS 
3| | 
= 
S 
J) 


Find a series for each function, using the formula for Maclaurin series and algebraic manip- 
ulation as appropriate. 


ee 
arctan(x) => 


Use the answer to the previous problem to discover a series for a well-known mathematical 
constant. => 


A 


Selected Answers 


1.1.1. (2/3)x + (1/3) 
1.1.2. y= —2e 
1.1.3. (—2/3)a + (1/3) 
1.1.4. y=224+2,2,-1 
1.1.5. y=—2 +6, 6, 6 
1.1.6. y=2/2+1/2, 1/2, -1 
1.1.7. y = 3/2, y-intercept: 3/2, no 
x-intercept 
1.1.8. y = (—2/3)x — 2, —2, -3 
1.1.9. yes 
L110. gS Oy = S26 2g. = 22 
1.1.11. y = 75t (t in hours); 164 minutes 
1.1.12. y = (9/5)x + 32, (—40, —40) 
1.1.13. y= 0.152 +10 
1.1.14. 0.03” + 1.2 
115. (a) ae = 
0 0<a< 100 
(x/10)—10 100<«< 1000 
x — 910 1000 < x 


1.1.16. 


i Rs Ey ge 


1.1.18. 
Lek, 


1.2.2. 


1.2.6. 
1.3.1. 
1.3.2. 


Yy — 
0.15x 0<a < 19450 
0.28% — 2528.50 19450 < x < 47050 
0.332 — 4881 47050 < x < 97620 


(a) P = —0.00012 + 2 

(b) « = —10000P + 20000 
(2/25)x — (16/5) 

(a) 27 +y2=9 

(b) (x — 5)? + (y—-6)? =9 
(c) eee) + (y+6)? =9 
ae ae 3)? = 

(e) x 
(f) (a - 
(a) 
y= 
(b) 
y = —3x 4+ 2, /10 

(c) Ax = —2, Ay = -2, m = 1, 
y=2, V8 

(x + 2/7)? + (y — 41/7)? = 1300/49 
{x | x 2 3/2} 

{x | 2 #—1} 


TAs ae A eS ee 
(3/2)x — 3, V13 
A 


eS =), Ayia 3,10 = 3, 
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1.3.3. 
1.3.4. 
1.3.5. 
1.3.6. 
1.3.7. 
1.3.8. 
1.3.9. 
1.3.10. 
1.3.11. 
1.3.12. 
1.3.13. 
1.3.14. 
1.3.15. 


1.3.16. 
2.1.1. 


2.1.2. 
2.1.3. 


2.1.4. 


2.1.5. 


2.1.6. 
2.2.1. 
Dedede 
2.2.3. 


2.3.1. 
2.3.2. 
2.3.3. 


ee 
{x 


oe Vand ¢ 41} 
0} 
xé€R},ie., all x 
x > 0} 
h-r<a<h-+r} 
£> 1 org <0} 
—1/3 < « < 1/3} 
x>Oandz 41} 
x>Oandz 1} 


qe 8 has) Gee. OT 

A = «(500 — 2x), {x |0 < x < 250} 
V =7(50—-—ar?), {fr |O<r< 
50/7} 

A = 2nr? +2000/r, {r |0<r < co} 
—5, —2.47106145, —2.4067927, 
—2.400676, —2.4 

—4/3, 24/7, 7/24, 3/4 
—0.107526881, —0.11074197, 


—0.1110741, seen es 
3(3 + Az) 9 
34+ 3Axr + Ax? 
14+ Az 
3.31, 3.003001, 3.0000, 
3+ 3Az + Ax? > 3 


+3 


mm 
10, 25/2, 20, 15, 25, 35. 
5, 4.1, 4.01, 4.001,4+ At 4 


—10.29, —9.849, —9.8049, 
—9.8 —4.9At > —9.8 


7 
5 
0 


2.3.4. 
2.3.5. 
2.3.6. 
2.3.7. 
2.3.8. 
2.3.9. 
2.3.10. 
2.3.11. 
2.3.12. 
2.3.13. 
2.3.14. 
2.3.15. 
2.3.16. 
2.3.18. 


2.4.1. 
2.4.2. 
2.4.3. 
2.4.4. 
2.4.5. 
2.4.8. 
2.4.9. 
2.4.10. 
2.4.11. 
2.5.6. 
3.1.1. 
3.1.2. 
3.1.3. 
3.1.4. 
3.1.5. 
3.1.6. 


undefined 


does not exist 


v2 


3a? 


—2/(2¢ + 1)9/? 
5/(t +2)? 

y= —1382+17 
—8 

-0.5 or 1.3 or 3.2 
1002? 
—100¢~ 1! 


(3/4)e-¥/4 
=(9/7)07 161" 


3.2.1. 
3.2.2. 
3.2.3. 
3.2.4. 
3.2.5. 
3.2.6. 
3.2.7. 
3.2.8. 
3.2.9. 


3.2.11. 


3.2.12. 
3.3.1. 
3.3.2. 


3.3.3. 


3.3.4. 


3.3.5. 


3.4.1. 


3.4.2. 


3.4.3. 


3.4.4. 


3.4.5. 
3.4.6. 
3.4.8. 
3.4.9. 


15x? + 24 

—202* + 6x + 10/2? 
—30x + 25 

6x7 + 2x — 8 

327 + 60-1 

Qa? — 2/625 — 22 
y = 1382/44 5 

y = 24a — 48 — 7? 
—49t/5+5, —49/5 


n 
y ka,a®—! 
k=1 


x /16 — 32/444 
Se (a= be 10). 42° Bz? = 5) 


(x? + 5a —3)(5x* — 18x +62 —7)+ 


(2x + 5)(a° — 6x? + 3x? — 7x + 1) 


(soe ez 
2/a 625 — a? 

=] 20/625 — x2 
019/625 — a2 at 
f =A Qe —'3),y=4e—7 

3a? x? (3x7 — 5) 
w—5¢+10 («3 — 5a + 10)? 

2x£+5 


2 — 6x23 +322 —7r+1 — 


(x? + 5a — 3)(5x4 — 182? + 6x — 7) 


(2° — 623 + 3x2 — 7x + 1)? 


1 w3/2 
a 
2./2,/625 — 22 (625 — x?) 3/2 
=i) 20./625 — x2 
9/625 — x2 et 


y = 172/4— 41/4 

y = 11x/16 — 15/16 
y = 19/169 — 52/338 
13/18 


3.5.1. 
3.5.2. 
3.5.3. 


3.5.4. 
3.5.5. 


3.5.6. 
3.5.7. 


3.5.8. 


3.5.9. 


3.5.10. 


3.5.11. 


3.5.12. 


3.5.13. 


3.5.14. 


3.5.15. 


3.5.16. 


3.5.17. 
3.5.18. 
3.5.19. 
3.5.20. 
3.5.21. 
3.5.22. 
3.5.23. 
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Ag? — Qn? +447 
327 — da + 2//z 
6(a? + 1)?x 


/169 — 2? — 27/1/1169 — x? 


(Qa — 4)\/25 — 2?2— 
(x? — 4 + 5)x/+/25 — x? 


~<a fr? — 22 
223 //1+a4 
i) 


4,/x(5 — /x)3/2 
6+ 18x 
2g 


ee +e41 


1-2 


(1-2)? 


~1//25 — 2? — 25 — 22/2? 


1 / —169 1 / 169 
2 x? V a = 


3a? — 2¢ + 1/2? 


2\/x3 — x? — (1/2) 


300x 
(100 — #2)572 

1+ 32? 
3(a + «3)2/3 


(oc? +1)+ 5 


ay (0? $l)A ba T+ (eg? + 1)? 
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3.5.24. 
3.5.25. 
3.5.26. 
3.5.27. 
3.5.28. 
3.5.29. 
3.5.30. 
3.5.31. 
3.5.32. 
3.5.33. 
3.5.34. 
3.5.35. 


3.5.36. 
3.5.37. 
3.5.38. 
3.5.39. 


3.5.40. y= 


4.1.1. 
4.1.2. 
4.1.3. 
4.1.4. 
4.1.11. 
4.3.1. 
4.3.2. 
4.3.3. 
4.3.4. 
4.3.5. 
4.3.6. 
4.3.7. 
4.4.1. 


1/(z +1) 

3(8a — 2)/(4a? — 22 +1)? 

—327 +5¢—1 

62(2a — 4)? + 6(3a? + 1)(22 — 4)? 
—2/(a — 1)? 

Ag /(x? +1)? 

(x? — 6x + 7)/(x — 3)? 
—5/(82—4)? 

60a* + 72x° + 182? + 182 — 6 

(5 — 4x)/((2a + 1)*(a — 3)”) 
1/(2(2 + 3z)?) 

56a° + 72x° + 1102* + 100x? + 
60a? + 28x + 6 

y = 232/96 — 29/96 

y =3- 22/3 

y = 1382/2 — 23/2 

y=27 11 

20 + 2/5 3V5 
“Sieve Sy EaVs 
2nn — 1/2, any integer n 


nx + 7/6, any integer n 
(V2 + v6)/4 
—~(1+ V3)/(1-— V3) =2+ V3 


L772 


sin(y/x) cos( Vx) / Vx 


4.4.2. 


4.4.3. 


4.4.4. 


4.4.5. 


4.5.1. 
4.5.2. 


4.5.3. 


4.5.4. 


4.5.5. 
4.5.6. 
4.5.7. 
4.5.8. 
4.5.9. 


4.5.10. 


4.5.11. 
4.5.12. 


4.5.13. 
4.5.14. 
4.5.15. 
4.5.16. 
4.5.17. 


4.5.18. 


4.7.1. 


4.7.2. 


sin x 


2/x 
COs & 
sin? x 

(2x +1)sinx — (x? +2) cosz 


sin? x 


+ /x cos x 


—sinxcosx 


V1- sin? x 


cos? x — sin? x 


— sin x cos(cos x) 


tanxz + xsec? x 


2/zxtanz 
sec? x(1 +sinx) — tanxcosz 
(1+ sina)? 
— csc? x 
—cscx cot x 


3x” sin(23x7) + 46a* cos(232”) 
0 
—6 cos(cos(6x)) sin(6z) 


sec 6 tan 6 _ sin 0 
(1+sec@)2 (cos@+1)? 


5t* cos(6t) — 6t? sin(6t) 


3t?(sin(3t) + tcos(3t))/ cos(2t) + 
2t? sin(3t) sin(2t) / cos? (2t) 


nr /2, any integer n 

7/2+ nm, any integer n 

y = V3x/2+3/4— V3r/6 
y = 8V3a +4 — 8V31/3 

y = 3V3a/2 — V30/4 


1/6 + 2n7, 57/6 + 2n7, any integer 


n 

2In(3)a3”" 

cos x — sinx 
ex 


4.7.3. 
4.7.4. 
4.7.5. 


4.7.6. 


4.7.7. 
4.7.8. 
4.7.9. 
4.7.10. 
4.7.11. 
4.7.12. 
4.7.13. 
4.7.14. 
4.7.15. 
4.7.20. 
4.8.1. 
4.8.2. 
4.8.3. 
4.8.4. 
4.8.5. 
4.8.6. 
4.8.7. 
4.8.8. 
4.8.9. 
4.8.12. 
4.8.13. 
4.8.14. 


4.8.15. 
4.8.16. 


4.8.17. 


cos(x)e""” 
; sin x 
Sie (cosina + ) 
x 


3x7e* + xe” 

1+ 2” In(2) 

—2z In(3)(1/3)” 

e** (49 —1)/x? 

(3x7 + 3)/(z® +32) 

—tan(z) 

(1 = In(2?)) /(x?Vin(@)) 
sec(Zx) 

£8) (cos(x) /x — sin(x) In(a)) 


e 


x/y 
—(2x% + y)/(x + 2y) 


(Qay — 3a* — y*)/(2xy — 3y? — x”) 


sin(x) sin(y)/(cos() cos(y)) 


—Vy/Va 


(y sec? (a/y) —y?)/(asec?(a/y)+y7) 
(y — cos(x + y))/(cos(x + y) — 2) 


~y? fa? 
1 

y= JE 6 

C= Ves 

(V3,2V3), (—v3, -2V3), 
(2V3, V3), (-2V3, -v3) 
y = Tr/V3 —8/V3 


y= (yp Paty rar t+ay ys) /2 
(y =, y1) = (2a? ae 271y2 = X1) 

(2y? + 2yix? + y1) 
2X1) 


(x — 


. —32? cos(x?) / 
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1 — sin?(x3) 


” (arcsinx) V1 — x? 


—e*/\/1 — 2% 


Ma 


(1+Inz)z* 


" In 5(1 + 227) arctan(z*) 
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4.10.20. co 5.1.2. min at y = —1, max atr=1 
4.10.21. co Dilad.. Thao oe = 2, min, ate = 4 
4.10.22. 2/7 5.1.4. min at « = +1, max at x = 0. 
4.10.23. 2 5.1.5. min at r= 1 

4.10.24. —oo 5.1.6. none 

4.10.25. 1 5.1.7. none 

4.10.26. 1 5.1.8. min at = 77/12 4+ ka, max at 
4.10.27. 2 x = —1/12+kn, for integer k. 
4.10.28. 1 5.1.9. none 

4.10.29. 0 5.1.10. local max at x =5 

4.10.30. 1/2 5.1.11. local min at 2 = 49 

4.10.31. 2 5.1.12. local min at x = 0 

4.10.32. 0 5.1.15. one 

4.10.33. 00 5.2.1. min at x =1/2 

4.10.34. 1/2 5.2.2. min at « = —1, max at r= 1 
4.10.35. 0 5.2.3. max at c= 2, min at x = 4 
4.10.36. 1/2 5.2.4. min at x = +1, max at x = 0. 
4.10.37. 5 5.2.5. minat r= 1 

4.10.38. 2/2 5.2.6. none 

4.10.39. —1/2 5.2.7. none 

4.10.40. 2 5.2.8. min at « = 77/12 4+ ka, max at 
4.10.41. 0 x = —1/12+kn, for integer k. 
4.10.42. co 5.2.9. none 

4.10.43. 0 5.2.10. max at r= 0, min at ry = +11 
4.10.44. 3/2 5.2.11. min at « = —3/2, neither at x = 0 
4.10.45. oo 5.2.13. min at nz, max at 7/2+n7 
4.10.46. 5 5.2.14. min at 2na, max at (2n+1)7 
4.10.47. —1/2 5.2.15. min at 7/2+2n7, max at 37 /2+2n0 
4.10.48. does not exist 5.3.1. min at cx =1/2 

4.10.49. co 5.3.2. min at y= —1, max atr=1 
4.10.50. y=landy=-—1 5i.di. Max at c= 2, minats=—4 


Sle. min athe 1/2 5.3.4. min ata = +1, max at x= 0. 


5.3.5. 
5.3.6. 
5.3.7. 
5.3.8. 


5.3.9. 
5.3.10. 
5.3.11. 
5.3.12. 
5.3.13. 
5.3.14. 
5.3.15. 
5.3.16. 
5.3.17. 
5.3.18. 

5.4.1. 

5.4.2. 


5.4.3. 


5.4.4. 


5.4.5. 


5.4.6. 


5.4.7. 


5.4.8. 


5.4.9. 


minatzx=1 
none 
none 


min at x = 77/12 + nz, max at 
x = —1/12+ nz, for integer n. 
max at x = 63/64 

max at x = 7 

max at ma min at 5~!/4 
none 

max at —1, min at 1 

min at 2718 

none 

min at n7 

max at n7, min at 7/2+n7 

max at 7/2+2n7, min at 37 /2+2n7 


concave up everywhere 

concave up when x < 0, concave 
down when zx > 0 

concave down when x < 3, concave 
up when x > 3 

concave up when « < —1/V3 or 
x > 1/V3, concave down when 
-1/V¥3<«¢<1/v3 

concave up when x < 0 or x > 2/3, 
concave down when 0 < x < 2/3 
concave up when xz < 0, concave 
down when x > 0 

concave up when x < —1 or g > 1, 
concave down when —1 < x < 0 or 
Oe ek 

concave down on ((8n—1)7/4, (8n+ 
3)7/4), concave up on ((8n + 
3)m/4, (8n + 7)7/4), for integer n 


concave down everywhere 


5.4.10. 


5.4.11. 
5.4.12. 


5.4.13. 
5.4.14. 
5.4.15. 
5.4.16. 
5.4.17. 
5.4.18. 


5.4.19. 


6.1.1. 
6.1.2. 
6.1.3. 
6.1.4. 


6.1.5. 
6.1.6. 


6.1.7. 

6.1.8. 

6.1.9. 
6.1.10. 
6.1.11. 
6.1.12. 
6.1.13. 
6.1.14. 
6.1.15. 
6.1.16. 
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concave up on (—oo, (21 — V497) /4) 
and (21 + V497)/4, 00) 

concave up on (0, co) 

concave down on (2n7/3, (2n + 
1)7/3) 

concave up on (0, co) 

concave up on (—oo, —1) and (0, co) 
concave down everywhere 

concave up everywhere 

concave up on (7/4+n7,32/4+n7) 
inflection points at n7, 

+ arcsin(./2/3) + na 

up/incr: (3,00), up/decr: (—oo, 0), 
(2,3), down/decr: (0,2) 

max at (2,5), min at (0,1) 

25 xX 25 

P/4x P/4 

wH=l=2-5°7 1p=57, p/w = 
1/2 

v100 x ¥/100 x 2/100, h/s =2 
w =1=238y)/3, p= V3 /92/3 
hiweH 172 

1250 square feet 

1?/8 square feet 


$5000 

100 

yr 

hjr=2 

bir 9 

r= om, h= 40/7 cm, iff = 8/4 


8/1 
4/27 
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6.1.17. 
6.1.18. 


6.1.19. 


6.1.20. 


6.1.21. 
6.1.22. 


6.1.23. 
6.1.24. 
6.1.25. 
6.1.26. 
6.1.27. 


6.1.28. 


6.1.29. 
6.1.30. 


6.1.31. 


6.1.32. 
6.1.33. 
6.1.34. 


Go direct from A to D. 
(a) 2, (b) 7/2 
ec ee ee 
x a a 
6 6 2 =A 12 


(a) a/6, (b) (a +b - 

Va? — ab+ b?)/6 

1.5 meters wide by 1.25 meters tall 
If k < 2/m the ratio is (2 — km)/4; 
if k > 2/7, the ratio is zero: the 


window should be semicircular with 
no rectangular part. 


a/b 

w = 2r/V3, h = 2V2r/V3 
1/V3 © 58% 

18 x 18 x 36 

r =5/(2n)'/? = 2.7 cm, 

h = 5-2/9 fat/8 = dp & 10.8 om 


750 ( 2n2\'/3 
fy: = i 
nm \ 750? 
7502 \ 1/6 
e 
h/r = /2 


The ratio of the volume of the 


sphere to the volume of the cone is 
1033/4096 + 33/4096V17 = 0.2854, 
so the cone occupies approximately 
28.54% of the sphere. 

P should be at distance cW/a/(W/a+ 
Vb) from charge A. 

1/2 

$7000 

There is a critical point when 
sin6,/v, = sin@2/v2, and the 


second derivative is positive, so 


6.2.1. 
6.2.2. 
6.2.3. 
6.2.4. 
6.2.5. 
6.2.6. 
6.2.7. 
6.2.8. 
6.2.9. 
6.2.10. 
6.2.11. 
6.2.12. 
6.2.13. 
6.2.14. 
6.2.15. 
6.2.16. 
6.2.17. 
6.2.18. 
6.2.19. 
6.2.20. 
6.2.21. 
6.2.22. 
6.2.23. 
6.2.24. 
6.2.25. 


6.3.1. 
6.3.2. 
6.3.3. 
6.3.4. 


there is a minimum at the critical 


point. 

1/(167) cm/s 

3/(10007) meters/second 

1/4 m/s 

6/25 m/s 

807 mi/min 

3V5 ft/s 

20/(37) cm/s 

13/20 ft/s 

5V10/2 m/s 

75/64 m/min 

1457/72 m/s 

257/144 m/min 

mvV/2/36 ft/s 

tip: 6 ft/s, length: 5/2 ft/s 

tip: 20/11 m/s, length: 9/11 m/s 
380/V3 — 150 = 69.4 mph 
500/V3 — 200 = 88.7 km/hr 

18 m/s 

136V/475/19 = 156 km/hr 

—50 m/s 

68 m/s 

3800/V329 ~ 210 km/hr 
820/329 + 150V57/V/47 = 210 km/hr 
4000/49 m/s 

(a) x = acos@ —asin@ cot(@+ B) = 
asin 3/sin(@+ 8), (c) & 3.79 cm/s 
v3 = 1.475773162 

2.15 

3.36 

2.19 or 1.26 


6.4.1. 
6.4.2. 
6.4.3. 
6.4.4. 
6.5.1. 
6.5.2. 
6.5.6. 
6.5.7. 
6.5.8. 
6.5.9. 
Glee 
Tl2. 
7.1.3. 
7.1.4. 
TileDe 
7.1.6. 
7.1.7. 


7.1.8. 
7.2.1. 
7.2.2. 
7.2.3. 
7.2.4. 
7.2.5. 
7.2.6. 
sh Ey a 
7.2.8. 
1.2.9. 
7.2.10. 


7.2.11. 
2-12; 


Ay = 65/16, dy = 2 


Ay = \/11/10 — 1, dy = 0.05 
Ay = sin(7/50), dy = 7/50 


dV = 81/25 
el 
c= V18—2 


g°/3+47x?/2—5a+k 
arctanz +k 
z*/4—Ina+k 
—cos(2x)/2+k 

10 

35/3 


a 

277 —8 

2b? — 2a? 

4 rectangles: 41/4 = 10.25, 

8 rectangles: 183/16 = 11.4375 


23/4 
(16/3)27/? + C 
P+t+C 
8/2+C 
—2/z+C 
7Tins+C 

(5a +1)?/154+C 
(x-—6)?/3+C 
2x°/2 15 4+ C 
—4//z+C 


Ap GA Oe Eee =a ee, 


t>2 
87/2 
2 


7.2.13. 
7.2.14. 
7.2.15. 
7.2.16. 
7.2.17. 
7.2.18. 
7.2.19. 
7.2.20. 
7.2.21. 
7.2.22. 

7.3.1. 


7.3.2. 


7.3.3. 
7.3.4. 
7.3.5. 
7.3.6. 
8.1.1. 
8.1.2. 
8.1.3. 
8.1.4. 
8.1.5. 
8.1.6. 


8.1.7. 
8.1.8. 
8.1.9. 
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In(10) 
eal 

3° /4 

2°/6 — 1/6 
x? — 3a 


2a(x* — 3x7) 


tan(x”) 


2x tan(x*) 
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It rises until t = 100/49, then falls. 
The position of the object at time 
tis s(t) = —4.9t7 + 20t+k. The 

net distance traveled is —45/2, that 


is, it ends up 45/2 meters below 


where it started. The total distance 


traveled is 6205/98 meters. 


27 
i sint dt = 0 
0 


net: 27, total: 20/3 +4V3 
8 

17/3 

A= 18, B= 44/3, C=10/3 
=(1=7)'°/10+C 

x /5+22°/34+2+C 

(a? + 1)1° /202 + C 

=3( = 5t)° 100 

(sin* 2)/4+C 
—(100 — «”)?/7/34+C 
S/T = FEC 

sin(sin rt)/m +C 
1/(2cos? x) = (1/2) sec? 2 +C 
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8.1.10. 
8.1.11. 
8.1.12. 
8.1.13. 
8.1.14. 
8.1.15. 
8.1.16. 
8.1.17. 
8.1.18. 
8.1.19. 
8.1.20. 
8.2.1. 
8.2.2. 
8.2.3. 
8.2.4. 
8.2.5. 
8.2.6. 
8.2.7. 
8.2.8. 
8.2.9. 
8.2.10. 
8.3.1. 
8.3.2. 


8.3.3. 
8.3.4. 


8.3.5. 
8.3.6. 


8.3.7. 
8.3.8. 


—In|cosz|+C 
0 

tan?(x)/2+C 
1/4 

—cos(tanxz) + C 
1/10 

V3/4 

OTS) GAS TPP EO 
—(3" +1)/14 

0 

f(x)?/2 


x£/2—sin(2x)/4+C 
—cos x + (cos® z)/3 + C 
3x/8 — (sin 2x) /4 + (sin 4a) /32 + C 
(cos’ x)/5 — (cos? x) /3+C 
sin x — (sin? x)/3 + C 

x/8 — (sin 4xr)/32+C 
(sin® x) /3 — (sin? z)/5+C 
—2(cos x)°/2/5 + C 

tanz —cotr+C 
(sec? x) /3 — seca + C 
—In|cscx+cotae|+C 


—csex cot #/2 — (1/2) In| csex + 
cot z|+C 


ex? —1/2-In|at+ Vx? — 1|/2+C 
v9 + 4x? /2 a 

(9/4) In |2a + /9 + 4x?|+C 
Ola ae ee 

arcsin(x)/8 — sin(4arcsin x)/32 + C 
Inja + V1+22|+C 

(x + 1)Vx? +22/2 - 

Ina +14 Vx? + 2x|/2+C 


8.3.9. 
8.3.10. 
8.3.11. 
8.3.12. 

8.4.1. 

8.4.2. 

8.4.3. 

8.4.4. 

8.4.5. 


8.4.6. 
8.4.7. 
8.4.8. 


8.4.9. 


8.4.10. 


8.4.11. 


8.4.12. 
8.4.13. 
8.4.14. 
8.5.1. 
8.5.2. 


8.5.3. 
8.5.4. 
8.5.5. 
8.5.6. 
8.5.7. 
8.5.8. 


—arctanz —1/r+C 
2arcsin(x/2) — 2/4 — 22/2+C 
arcsin(/z) — /eV/1—24+C 
(2x? +1)\/4a2 — 1/24+C 
cosxz+asinz+C 

x’ sing —2sing + 2xcosx+C 
(2 — le? +C 

(1/2)e” +C 

(2/2) — sin(2x)/4+C = 

(x/2) — (sina cosx)/2+C 
rie =7C 

(x? arctan a + arctanx — x)/2+C 


—2° cosx + 3x7 sinx + 6rcosx — 


6sinz+C 
3 


x’ sina + 3x” cosx — 6rsinx — 
6cosx+C 

x? /4—(cos? x) /4—(a sin x cos x) /2+ 
C 

«/4—(x cos” x)/2+(cosxsinz)/4+ 
C 

x arctan(./x)+arctan(./r) —./2+C 
2sin(./x) — 2,/z cos(./z) + C 

sec x csc x — 2cotxr+C 

—In|z — 2|/44+In|x+2|/4+C 
—a?/3 — 4a — 4In|x — 2|+ 

A4ln |a+ 2|+C 

—1/(e+5)+C 

—2 —In|x —2|4+In|a+2/+C 
—4x + #3/3 + 8 arctan(x/2) + C 
(1/2) arctan(x/2 + 5/2) +C 

a? /2—2in(4+27) +C 

(1/4) In |a + 3] — (1/4) Injxe +7|+C 


8.5.9. 
8.5.10. 
8.6.1. 
8.6.2. 
8.6.3. 
8.6.4. 
8.6.5. 


8.6.6. 


8.6.7. 
8.6.8. 
8.6.9. 
8.6.10. 


8.7.1. 


8.7.2. 


8.7.3. 


8.7.4. 


8.7.5. 
8.7.6. 


8.7.7. 


8.7.8. 


8.7.9. 


8.7.10. 
8.7.11. 


8.7.12. 


(1/5) In |2a — 3] — (1/5) In|jl+2|/+C 

(1/3) In |a| — (1/3) In |x + 3) +C 

TS: 4+£0 

T: 9.28125 + 0.281125; S$: 9+0 

T: 60.75 + 1; S: 60 +0 

T: 1.1167+0.0833; S: 1.1000+0.0167 

T: 0.3235 + 0.0026; S: 0.3217 + 

0.000065 

T: 0.6478 + 0.0052; S: 0.6438 + 

0.000033 

T: 2.8833+0.0834; S: 2.9000+0.0167 

T: 1.1170+0.0077; S: 1.1114+0.0002 

T: 1.097 + 0.0147; S: 1.089 + 0.0003 

T: 3.63 + 0.087; S: 3.62 + 0.032 

(t+4)4 

4 

(i = 9 )P/2 
5 

(ec +16)? 
4 


ae 


+C 


+C 


2 
cost — zoos t+C 


tan? t 
2 
In|? +¢+ 3) +C 


+C 


1 
sini —4/?|+C 


1 

5p tan(arcsin(t/5)) +C = 
a2 
25/25 — t? 
2 

3 sin 3t ++C 
ttant + In|cost|+C 


2Vet +14+C 
3t + sin 2t rs sin 4¢ 
8 4 32 


+C 


+C 


8.7.13. 


8.7.14. 


8.7.15. 


8.7.16. 


8.7.17. 


8.7.18. 


8.7.19. 


8.7.20. 


8.7.21. 


8.7.22. 


8.7.23. 


8.7.24. 


8.7.25. 


8.7.26. 


8.7.27. 


8.7.28. 


SAL. 
9.1.2. 
9.1.3. 
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In|t} Injé+3 
nll Inlt+31 


3 3 
-1 
a Oa a aie 
sin arctan t 
—l 
————.~ +C 
2(1 + tant)? = 
2 5/2 2 3/2 
Cre Gaye. 
5 3 
Pedi — 
e’sint — e’ cost 4C 
2 
$3/2 4 4 
slats et ae 
6 
2 1 
— C 
3(2 — #2)3/2 (2 —#2)1/2 + 
In| si tan(2t/3 
n[sin(arctan\ /3))| ; 
l 2) 2 
n(4t“) — In(9 + 4t*) We, 
18 
2 
(arctan(2t)) We 
4 
3ln|t+3]  Injé-1| 
1 + 1 +C 
cos’t cos’ t 
C 
rg 5 a 
-1 
—— +C 
t—3 
-1 
Pui)? a ee 
ee ries 
(e? —3t? + 6t —6)e’+C 
BY tales ron ee 
SV in(ar-+ 1+ V5) +C 
8/2/15 
1/12 
9/2 
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9.1.4. 
9.1.5. 
9.1.6. 
Pe iy 
9.1.8. 
9.1.9. 
9.1.10. 
9.1L. 
9.1.12. 
9.2.1. 
9.2.2. 
9.2.3. 
9.2.4. 
9.2.5. 
9.2.6. 
9.2.7. 
9.2.8. 


9.24. 


9.2.10. 


9.2.11. 


9.3.5. 
9.3.6. 
9.3.7. 
9.3.8. 


9.3.9. 
9.3.11. 


4/3 

2/3 —2/n 

3/n — 3V3/(2m) — 1/8 
1/3 

10V5/3 — 6 

500/3 


1/6 

1/n, 5/7 

0, 245 

20, 28 

(3 —1)/(2m), (18 — 12V3 + 2)/(47) 
10/49 meters, 20/49 seconds 

45/98 meters, 30/49 seconds 
25000/49 meters, 1000/49 seconds 
s(t) = cost, v(t) = —sint, 
maximum distance is 1, 


maximum speed is 1 

s(t) = —sin(at)/x? + t/7, 

u(t) = —cos(rt)/m + 1/n, 
maximum speed is 2/7 

s(t) = t?/2 —sin(rt)/n* + t/a, 
u(t) =t — cos(nt)/m +1/n 

s(t) = t?/2 + sin(xt)/x? — t/a, 
u(t) =t+cos(at)/m —1/a 
87/3 

1/30 

n(a/2 —1) 

(a) 1147/5 (b) 7472/5 (c) 207 
(d) 4x 

16m, 247 

th? (3r — h)/3 


9.3.13. 
9.4.1. 
9.4.2. 
9.4.3. 
9.4.4. 
9.4.5. 
9.4.6. 
9.5.1. 
9.5.2. 
9.5.3. 
9.5.4. 
9.5.5. 
9.5.6. 
9.5.7. 
9.5.8. 
9.5.9. 

9.5.10. 
9.6.1. 
9.6.2. 
9.6.3. 
9.6.5. 
9.6.6. 
9.6.7. 
9.6.8. 
9.6.9. 

9.6.10. 

9.6.11. 

9.6.12. 

9.6.13. 
9.7.1. 
9.7.2; 
9.7.3. 


27 

2/1; 2/7; 0 

4/3 

1/A 

1/4 

Bah 

—4,/1224 ft/s; —8V/1224 ft/s 
& 5, 305, 028,516 N-m 

~ 4,457, 854,041 N-m 
367, 5007 N-m 

490007 + 196000/3 N-m 
24507 N-m 

0.05 N-m 

6/5 N-m 

3920 N-m 

23520 N-m 

12740 N-m 

15/2 


diverges 


9.7.4. 
9.7.5. 
9.7.6. 
9.7.7. 
9.7.8. 
9.7.9. 
9.7.10. 
9.7.11. 
9.7.12. 
9.7.13. 
9.7.14. 


9.8.2. 


9.8.3. 


9.8.4. 
9.8.5. 
9.8.9. 
9.9.1. 
9.9.2. 
9.9.3. 
9.9.4. 
9.9.6. 
9.9.7. 
9.9.8. 
9.9.9. 


9.10.1. 


9.10.3. 


9.10.4. 


diverges 

1 

diverges 

2 

diverges 

7/6 

diverges, 0 

diverges, 0 

diverges, no CPV 

1 

80 mph: 90.8 to 95.3 N-m 
90 mph: 114.9 to 120.6 N-m 
100.9 mph: 144.5 to 151.6 N-m 
pa Leve=1fe 

(b —a) 


p= (at+db)/2,0= WE 


7/2 
21/2 

r=6 

(22/22 — 8)/27 

In(2) + 3/8 

ata?®/3 

In((v2 + 1)/v3) 

3/4 

& 3.82 

x= 1.01 
Ji+e2—V241—-n(V1t+e2+ 
1) +In(v2 +1) 


1672 
813 — we 


7307/730 = 107/10 
oF oF 


w+ 27e + : re? - 
4 Ade? e€ 


9.10.6. 


9.10.7. 


9.10.8. 


10.1.2. 
10.1.3. 
10.1.4. 
10.1.5. 
10.1.6. 
10.1.7. 
10.1.8. 
10.1.9. 
10.1.10. 
10.1.11. 
10.1.12. 
10.1.21. 
10.1.22. 
10.1.23. 
10.1.24. 
10.2.1. 


10.2.2. 


10.2.3. 


10.2.4. 
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81? 
81? 
20 
3V3 
a> b: 2nb?+ 
Qna2b 
———— arcsin( Va? — b?/a), 
aaa 
a <b: Qrb?7+ 
Qna2b In b “: b2 — a2 
b? — a? a a 


6 = arctan(3) 


r = —4csc0 


r? cos@ sin? @ = 1 


r=v5 


r? = sin @ sec? 6 


rsin@ = sin(r cos @) 
r = 2/(sin0@ — 5cos6@) 
r = 2sec0 


2 


=r’ cos?6—rsind+1 


jo) 


0 = 3r? cos? 6 — 2rcos@ — rsind 
r=sine 
(a? +. y7)* = 4a7y — (2* + y")y 
(x? yh = y? 
a? a? = ay? 
Payee 
(A cos @ + sin @)/(—@sin 6 + cos 8), 
(6? + 2) /(—@sin 6 + cos 6)? 

cos 6 + 2sin 6 cos 6 
cos? @ — sin? @ — sin@’ 

3(1 + sin @) 

(cos? 6 — sin? 6 — sin 0)3 
(sin? 6 — cos? 6) /(2sin @ cos 8), 
—1/(4sin? 0 cos? 0) 

2sin 8 cos 0 2 


. p) . 
cos? @ — sin? 0’ (cos? @ — sin? 6)3 
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10.2.5. undefined 10.5.1. There is a horizontal tangent at all 
Oe 2sin @ — 3sin? 0 multiples of 7. 
3cos? 6 — 2cos0 10.5.2. 97/4 
3.cos* @ — 3.cos*@ +2 . 
a i 
2 cos 6(3 cos? @ — 2)8 10.5.3. i ~V/5 — Acost dt 
10.3.1. 1 0 2 


10.3.2. 97/2 10.5.4. Four points: 


10.3.3. 3/3 SESE pe Ne . 


10.3.4. 7/12 + V3/16 


10.3.5. a2 —34+3V5 4. /5+ V5 
10.3.6. 4171/2 =: : 
10.3.7. 2— 1/2 10.5.5. 1171/3 

10.3.8. 1/12 10.5.6. 32/3 

10.3.9. 37/16 10.5.7. 27 


10.3.10. 7/4 —3V3/8 
10.3.11. 1/2 +3V3/8 
10.3.12. 1/2 

10.3.13. 3/2—7/4 


10.5.8. 16/3 
10.5.9. (7/2, 1) 
10.5.10. 57°/6 


- 10.5.11. 27°? 
10.3.14. 7/3 + V3/2 
Ree 10.5.12. (20/4n2 +1 + In(2m + 
10.3.15. te — /3/4 Jie +1))/2 
10.3.16. 47° /3 Gad A 
10.3.17. 1° 
11.1.3. 0 
10.3.18. 57/24 — V3/4 
11.1.4. 1 
10.3.19. 77/12 — V3 
11.1.5. 1 
10.3.20. 42 — V15/2 — 7arccos(1/4) ree ar 
We Yea; fa: lim n?/(2n? +1) =1/2 
anit t 2.1. jim 7 m+lj= 
10.4.6, 2=1- Oy 21 89 7 
11.2.2. lim 5/(2/" +414) =1/3 
10.4.7. x = 4cost — cos(4t), n—¥00 


y = Asint — sin(4t) 
10.4.8. x = 2cost + cos(2t), 
y = 2sint — sin(2t) verges 
10.4.9. 2 =cost +tsint, 11.2.4. —3/2 
y =sint — tcost 11.2.5. 11 


1 
11.2.3. 5°, + diverges, so So 3- di- 
n 


n=1 


11.2.6. 
11.2.7. 
11.2.8. 
11.2.9. 
11.3.1. 
11.3.2. 
11.3.3. 
11.3.4. 
11.3.5. 
11.3.6. 
11.3.7. 
11.3.8. 
11.3.9. 
11.3.10. 
11.3.11. 
11.3.12. 
11.4.1. 
11.4.2. 
11.4.3. 
11.4.4. 
11.4.5. 
11.4.6. 
11.5.1. 
11.5.2. 
11.5.3. 
11.5.4. 
11.5.5. 
11.5.6. 
11.5.7. 
11.5.8. 
11.5.9. 
11.5.10. 


20 

3/4 

3/2 

3/10 
diverges 
diverges 
converges 
converges 
converges 
converges 
diverges 
converges 
N=5 
N=10 
N = 1687 
any integer greater than e?2° 
converges 
converges 
diverges 
converges 
0.90 

0.95 
converges 
converges 
converges 
diverges 
diverges 
diverges 
converges 
diverges 
converges 


diverges 


11.6.1. 
11.6.2. 
11.6.3. 
11.6.4. 
11.6.5. 
11.6.6. 
11.6.7. 
11.6.8. 
11.7.5. 
11.7.6. 
11.7.7. 
11.7.8. 
11.8.1. 
11.8.2, 
11.8.3. 
11.8.4. 
11.8.5. 
11.8.6. 
11.9.1. 


11.9.2. 


11.9.3. 


11.9.4. 


11.9.5. 


11.10.1. 


11.10.2. 


Appendix A Selected Answers 


converges absolutely 
diverges 

converges conditionally 
converges absolutely 
converges conditionally 
converges absolutely 
diverges 

converges conditionally 
converges 

converges 

converges 

diverges 
R=1,I=(-1,1) 
R= co, 1 = (—co,'00) 
S67 = (exe) 
R=e, I =(2-e,2+¢e) 


R=0, converges only when x = 2 


Roh 6 


the alternating harmonic series 


So (nt 1x” 

n=0 

So (n+ 1)(n + 2)a” 

n=0 

~ (n+1)(n+2) " x 

d 5 cee 4 fae 
CO —1 sk 

Ct 2 Ganga? : 

S°(-1)"2""/(2n)!, R = co 

n=0 


CO 
S- onl, R= oo 
n=0 
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ya 11.12.10. converges 
1.10.3. xe aa es 
- 11.12.11. converges 
i | 11.12.12. converges 
1.10.4. S0(-1)"" eS psa 
a5 nm 11.12.13. converges 
oo 11.12.14. converges 
\er2 it (x = = fat 
eae: ea) gm? = 2 41.12.15. converges 
00 11.12.16. converges 


11.10.6. $°(-1)"(n+1)(@-1)",R=1 44.42.17. averse 
n=0 
ee ee 11.12.18. (—co, co) 
n— n 
11.10.7. +> = a” = 11.12.19. (—3,3) 
11.12.20. (—3,3) 
(2n — 1)! a _ 
1+ 5m get >) adden. Gia) 


11.12.22. radius is 0—it converges only when 


11.10.8. 2+ 2°/3 


t=) 
11.10.9. So (-1)"2t"t1/(2n)! 11.12.23. (—v3, V3) 
n=0 11.12.24. (—co, 0) 
he tn °° (In(2))” 
11.10.10. 5 §(-1)"“"*1/n! 11.12.25. 5) ey ae 
a — nt 
evade Sel gaa 
fy 2 1 94 790 ig) 11.12.26. > aaa" 
11.11.2. 1000; 8 toe 
3 5 — 2 2n+1 
11.11.38. 2+ > + =, error £1.27. 11.12.27. S 7 Se fs 
n=0 
11.12.1. diverges 11.12.28. 1 + 2/2 + 
11.12.2. converges ee ‘a 
2° ay) 
11.12.3. converges n=2 
11.12.4. diverges 11.12.29. 3. (-1)"2" 
11.12.5. diverges n=0 
12.6. di = (—1)” 
11.12.6. diverges 11.12.30. ys gts 
11.12.7. converges n=0 at: 
11.12.8. converges — 1 
11.12.31. r= —1)” 
rie Do rar 


11.12.9. converges n=0 


B 


Useful Formulas 


Algebra 
Remember that the common algebraic operations have precedences relative to each other: 


for example, multiplication and division take precedence over addition and subtraction, but 
are “tied” with each other. In the case of ties, work left to right. This means, for example, 
that 1/272 means (1/2)x: do the division, then the multiplication in left to right order. It 
sometimes is a good idea to use more parentheses than strictly necessary, for clarity, but 


it is also a bad idea to use too many parentheses. 
. 2 
Completing the square: x? + br +e=(x+ 8)? -F +e. 


—b+ Vb? — 4ac 


2a 


Quadratic formula: the roots of ax? + ba + ¢ are 


Exponent rules: 


a:a=a 
b 
a p= 
oo 
(a®)° = a® 


Geometry 
Circle: circumference = 27r, area = 1r?. 


Ellipse: area = mab, where 2a and 2b are the lengths of the axes of the ellipse. 
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Sphere: vol = 4rr3/3, surface area = 4rr?. 
Cylinder: vol = mr?h, lateral area = 27rh, total surface area = 2mrh 4+ 2rr?. 


Cone: vol = mr*h/3, lateral area = arv/r2 + h2, total surface area = mrvVr? + h? + 


rr. 


Analytic geometry 


Point-slope formula for straight line through the point (x0, yo) with slope m: y = 
Yo + m(ax — Zo). 
Circle with radius r centered at (h,k): (x —h)? + (y—k)? =r. 


2 2 


x 
Ellipse with axes on the z-axis and y-axis: — + a =e 
a 


Trigonometry 


cos(@ + 7) = —cos(6) 


Law of cosines: a? = b? + c? — 2bcecos A 
_ b a at 
snA sinB sinC 


Sine of sum of angles: sin(x + y) = sinxcosy + coszsiny 


Law of sines: 
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Sine of double angle: sin(2x) = 2sinx cos x 
Sine of difference of angles: sin(x — y) = sinx cosy — cosx sin y 


Cosine of sum of angles: cos(x + y) = cosx cosy — sinzsiny 


2 2 


Cosine of double angle: cos(2x) = cos? « — sin? « = 2cos? x — 1 = 1-— 2sin? x 


Cosine of difference of angles: cos(x — y) = cosxcosy + sinx sin y 


tanz + tany 


Tangent of sum of angles: tan(x + y) = 4 ; 
—tanxtany 


sin?(9) and cos?(@) formulas: 


sin?(0) + cos?(0) = 1 
tan?(9) + 1 = sec?(6) 


2 
aro cos(26) 
2 
1 
cos?(@) = + cos(20) 


y) 
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Index 


A completing the square, 313 
composition of functions, 25, 43, 65 
absolute extremum, 117 concave down, 112 
algebraic precedence, 313 concave up, 112 
alternating harmonic series, 271 cone 
antiderivative, 153 lateral area, 314 
arc length, 230 surface area, 314 
arccosine, 94 volume, 314 
arcsine, 92 continuous, 53 
area convergent sequence, 257 
between curves, 189 convergent series, 262 
under curve, 149 coordinates 
asymptote, 21, 113 Cartesian, 239 
converting rectangular to polar, 240 
B polar, 239 
rectangular, 239 
bell curve, 224 cosines 
bounded function, 52 law of, 314 
critical value, 106 
C cumulative distribution function, 224 
curve sketching, 105 
cardioid, 240 cycloid, 249 
Cartesian coordinates, 239 cylinder 
Cauchy Principal Value, 218 lateral area, 314 
center of mass, 213 surface area, 314 
centroid, 214 volume, 314 
chain rule, 65 
chord, 30 D 
circle 
area, 313 definite integral, 157 
circumference, 313 dependent variable, 22 
equation of, 19, 314 derivative, 32, 46 
unit, 19 dot notation, 130 
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Leibniz notation, 47 Hooke’s Law, 209 
second, 110, 111 hyperbolic cosine, 100 
difference quotient, 30 hyperbolic sine, 100 
differentiable, 53 hypercycloid, 250 
differential, 142 hypocycloid, 250 
discrete probability, 221 
divergence test, 264 T 
divergent sequence, 257 
divergent series, 262 implicit differentiation, 87 
domain, 21 implicit function, 88 
dot notation, 130 improper integral, 217 
convergent, 217 
E diverges, 217 
indefinite integral, 157 
ellipse independent variable, 22 
area, 313 inflection point, 112 
equation of, 314 integral 
ellipsoid, 237 improper, 217 


indefinite, 157 

of secxz, 173 

of sec? x, 173 
properties of, 161 


error estimate, 183 
escape velocity, 220 
exp function, 83 

expected value, 222 ; : 
exponential distribution, 225 integral sign, 153 
exponential function, 80 integral test, 268 


Extreme Value Theorem, 119 integration 
by parts, 174 


Intermediate Value Theorem, 53 
interval of convergence, 282 
inverse function, 80 

inverse sine, 92 

involute, 250 


F 


Fermat’s Theorem, 106 
frustum, 233 
function, 20 


bounded, 52 
differentiable, 53 K 
implicit, 88 kinetic energy, 219 
linear, 20 
rational, 178 L 
unbounded, 52 
function composition, 25 L’Hopital’s Rule, 96 
Fundamental Theorem of Algebra, 109 lateral area of a cone, 128 
Fundamental Theorem of Calculus, 151 law of cosines, 314 
law of sines, 314 
G Leibniz notation, 47 
limit, 39 
Gabriel’s horn, 221, 236 limit at infinity, 96 
geometric series, 262 limit of a sequence, 257 
global extremum, 117 linear approximation, 141 
greatest integer, 108 linearity of the derivative, 58 
local extremum, 105 
H local maximum, 105 
local minimum, 105 
harmonic series, 265 logarithm, 80 


alternating, 271 logarithmic function, 80 


long division of polynomials, 180 


M 


Maclaurin series, 286 
mean, 222, 225 

Mean Value Theorem, 144 
moment, 213 


N 


Newton, 208 
Newton’s method, 137-139 
normal distribution, 230 


O 


one sided limit, 44 
optimization, 117 


P 


p-series, 268 
parametric equations, 248 
partial fractions, 178 
physicists, 130 
point-slope formula, 314 
polar coordinates, 239 
power function, 55 
power rule, 55, 87, 90 
precedence 
of algebraic operations, 313 
probability density function, 223 
product rule, 60, 61 
generalized, 62 
properties of integrals, 161 


Q 


quadratic formula, 313 
quotient rule, 62 


R 


radian measure, 71 

radius of convergence, 282 
random variable, 222 
rational function, 64, 178 
rectangular coordinates, 239 
related rates, 129 

Rolle’s Theorem, 144 


Index 


S 


second derivative, 110, 111 
sequence, 256 

bounded, 260 

bounded above, 260 
bounded below, 260 
convergent, 257 
decreasing, 260 
divergent, 257 

increasing, 260 
monotonic, 260 
non-decreasing, 260 
non-increasing, 260 

of partial sums, 262 
series, 256 

p-series, 268 

absolute convergence, 276 
alternating harmonic, 271 
conditional convergence, 276 
convergent, 262 
divergent, 262 

geometric, 262 

harmonic, 265 

integral test, 268 

interval of convergence, 282 
Maclaurin, 286 

radius of convergence, 282 


Taylor, 288 
Simpson’s Rule, 185 
sines 

law of, 314 
sphere 


surface area, 314 
volume, 314 
spiral of Archimedes, 241 
squeeze theorem, 75 
standard deviation, 227 
standard normal distribution, 225 
standard normal probability density 
function, 224 
subtend, 72 
sum rule, 59 


T 


tangent line, 30 

Taylor series, 288 

Toricelli’s trumpet, 221, 236 
torque, 211 

torus, 236 

transcendental function, 71 
Trapezoid Rule, 183 
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triangle inequality, 41 
trigonometric identities, 314 


U 


unbounded function, 52 

uniform distribution, 224 

uniform probability density function, 224 
unit circle, 19 


V 


variance, 227 
velocity, 35 


Ww 


witch of Agnesi, 64 
work, 207 


